---
title: "Multi-Agent System Reliability"
type: source
tags: [multi-agent, 可靠性, llm, architecture]
date: 2026-04-13
---

## Source File
- [[raw/AI/Multi-Agent System Reliability.md]]

## Summary
- 核心主题：提高多智能体系统可靠性的四大架构模式
- 问题域：LLM固有的不可靠性（幻觉、逻辑谬误、上下文漂移）如何在多智能体拓扑中传播并被控制
- 方法/机制：等级制度（Hierarchy）、共识投票（Consensus）、对抗性辩论（Adversarial Debate）、淘汰制（Knock-out）
- 结论/价值：从"AI原型"到"企业级AI"的关键转变：停止将LLM当魔法聊天机器人，开始将其视为分布式系统中不可靠的组件

## Key Claims
- 等级制度（Hierarchy）：Planner分解任务，Worker执行，Validator验证，通过依赖图强制协作
- 共识投票（Consensus）：N个LLM独立执行同一任务，选多数票；3个模型同时产生相同幻觉概率仅0.8%
- 对抗性辩论（Adversarial Debate）：Generator提方案，Critic攻击，Judge裁决；用外部批评者模拟"恐惧"
- 淘汰制（Knock-out）：N个Agent竞争，验证器决定淘汰谁；LLM Agent是"cattle not pets"
- 核心原则：构建稳健系统需停止要求模型"小心"，开始强制它"正确"

## Key Quotes
> "To build robust systems, we need to stop asking the model to 'be careful' and start forcing it to be correct" — Alex Ewerlöf
> "Don't anthropomorphize LLMs! Find a way to piggy back on their human-corpus training while being aware of their non-biological differences" — Alex Ewerlöf

## Key Concepts
- [[多Agent可靠性模式]]：等级制度、共识投票、对抗性辩论、淘汰制四种架构模式
- [[共识投票]]：多数票机制降低LLM幻觉概率
- [[对抗性辩论]]：用外部批评者模拟恐惧感，避免模型自我中心
- [[淘汰制]]：对待LLM Agent如"cattle"（可替换）而非"pets"（独特珍贵）
- [[依赖图强制协作]]：Planner→Worker→Validator顺序执行，Worker必须等待Planner输入

## Key Entities
- [[Alex Ewerlöf]]：作者，27年经验的资深工程师，可靠性工程专家
- [[LLM]]：多智能体系统的底层不可靠组件

## Connections
- [[多Agent可靠性模式]] ← 核心主题 ← 可靠性工程
- [[等级制度]] ← 模式1 ← [[多Agent可靠性模式]]
- [[共识投票]] ← 模式2 ← [[多Agent可靠性模式]]
- [[对抗性辩论]] ← 模式3 ← [[多Agent可靠性模式]]
- [[淘汰制]] ← 模式4 ← [[多Agent可靠性模式]]

## Contradictions
- 与传统单LLM使用对比：单LLM依赖模型自身可靠性；多智能体通过架构模式引入冗余和验证机制
- 与拟人化LLM谬误对比：不应将LLM视为有情感和恐惧的实体，而应视为不可靠但可优化的组件