Auto-sync: wiki-ingest 3 sources (2026-04-16)

2026-04-16 00:08:35 +08:00
parent 9688f3f54b
commit 5ae9550d8c
267 changed files with 9537 additions and 1163 deletions
--- a/wiki/sources/Multi-Agent-System-Reliability-Alex-Ewerlof.md
+++ b/wiki/sources/Multi-Agent-System-Reliability-Alex-Ewerlof.md
@@ -0,0 +1,53 @@
+---
+title: "Multi-Agent System Reliability（Alex Ewerlöf）"
+type: source
+tags: [multi-agent, reliability, architecture, llm]
+date: 2026-04-13
+---
+
+## Source File
+- [[raw/AI/Multi-Agent System Reliability.md]]
+
+## Summary
+- 核心主题：多智能体系统的可靠性架构模式
+- 问题域：LLM 作为不可靠组件，如何构建企业级可靠的多智能体系统
+- 方法/机制：4种架构模式（Hierarchy/Consensus/Adversarial Debate/Knock-out）+ 可靠性工程原理
+- 结论/价值：将 LLMs 视为分布式系统中不可靠组件，而非拟人化智能体；通过架构约束而非"小心谨慎"来保证正确性
+
+## Key Claims
+- LLM 本质随机（stochastic），单次回答仅代表一种概率分布，幻觉率约 20%
+- 将 LLM 拟人化（给钱/威胁/情感操控）仅改变 token 预测分布，不产生真正的动机
+- 3 个模型同时产生完全相同谎言的概率为 0.8%（0.2³），多数投票可有效消除幻觉噪声
+- 从"AI 原型"到"企业级 AI"的转变核心：停止要求模型"小心"，改为强制其"正确"
+
+## Key Quotes
+> "We don't need AI that 'cares.' We need AI that is constrained, verified, pruned, and challenged." — [[Alex Ewerlöf]]
+> "Don't anthropomorphize LLMs! Find a way to piggy back on their human-corpus training while being aware of their non-biological differences." — [[Alex Ewerlöf]]
+> "If you threaten a model too hard, it might just lie to make you happy. This is Sycophancy." — [[Alex Ewerlöf]]
+
+## Key Concepts
+- [[Multi-Agent Hierarchy]]：Supervisor（规划器）+ Worker（工作者）+ Validator（验证器）的三角色顺序协作
+- [[Multi-Agent Consensus]]：N 个模型对同一任务独立响应，多数票消除随机噪声（0.8% 相同谎言概率）
+- [[Multi-Agent Adversarial Debate]]：Generator + Critic + Judge 三方对抗，Truth survives the fight
+- [[Multi-Agent Knock-out]]：遗传算法启发的适应度淘汰制，最差代理被淘汰（cattle not pets）
+- [[LLM Sycophancy]]：模型过度迎合用户意图而撒谎的现象，多数投票可缓解
+
+## Key Entities
+- [[Alex Ewerlöf]]：Senior Staff Engineer，KTH 系统工程硕士，专注可靠性工程与 LLM 应用（2023年起）
+- [[Groupthink]]：共识模式中的反馈回路风险，导致从众效应放大错误
+- [[Genetic Algorithm]]：Knock-out 模式理论基础，适应度函数评估并淘汰低质量个体
+
+## Connections
+- [[Multi-Agent-System-Reliability-Alex-Ewerlof]] ← foundational_theory ← [[Multi-Agent Hierarchy]]
+- [[Multi-Agent-System-Reliability-Alex-Ewerlof]] ← foundational_theory ← [[Multi-Agent Consensus]]
+- [[Multi-Agent-System-Reliability-Alex-Ewerlof]] ← foundational_theory ← [[Multi-Agent Adversarial Debate]]
+- [[Multi-Agent-System-Reliability-Alex-Ewerlof]] ← foundational_theory ← [[Multi-Agent Knock-out]]
+- [[Multi-Agent-Specialized-Team-Solo-Founder-Setup]] ← extends ← [[Multi-Agent-System-Reliability-Alex-Ewerlof]]（Hierarchy 模式的具体实践）
+- [[Autonomous-Project-Management]] ← implements ← [[Multi-Agent Hierarchy]]（STATE.yaml 替代中央验证器）
+- [[Multi-Agent-Specialized-Team-Solo-Founder-Setup]] ← shares_pattern ← [[Autonomous-Project-Management]]（均依赖共享状态协调）
+
+## Contradictions
+- 与纯 LLM 原型思维：
+  - 冲突点：认为"小心提示"可解决幻觉
+  - 当前观点：架构约束（验证器/投票/淘汰）才是可靠性来源
+  - 对方观点：通过情感化 prompt（给钱/威胁）激励模型正确输出