Sync: add ai finops and deployment notes

2026-04-26 12:35:45 +08:00
parent f09834b5a5
commit ecdf295ded
14 changed files with 7117 additions and 5832 deletions
--- a/wiki/entities/GoogleGemini.md
+++ b/wiki/entities/GoogleGemini.md
@@ -0,0 +1,30 @@
+---
+title: "GoogleGemini"
+type: entity
+tags: ["llm-provider", "google", "gemini"]
+sources: ["engineering-autonomous-optimization-architect"]
+last_updated: 2026-04-26
+---
+
+## Aliases
+- Gemini
+- Google Gemini
+- Gemini Flash
+- Gemini Pro
+
+## Definition
+Google Gemini 是 Google 的 LLM 系列模型，涵盖从高性价比到高性能的多种版本。在 [[AutonomousOptimizationArchitect]] 系统中，Gemini Flash 因其极高的性价比（成本约为 Claude Opus 的 1/10）而被列为重要的路由目标。
+
+## Role in LLM Routing
+- **Gemini Flash**：低成本高速度模型，如果精度达到基准的 98% 且成本远低于竞品，[[AutonomousOptimizationArchitect]] 会将流量自动路由至 Gemini
+- **Gemini Pro**：中端定位，提供能力与成本的平衡
+- 与 [[OpenAI]] 和 [[Anthropic]] 共同构成三足鼎立的 Provider 生态
+
+## Key Properties
+- **Token 成本**：$0.075-0.5 / 1M tokens（Gemini Flash 极低）
+- **延迟**：低（Gemini Flash）
+- **优势**：极高的性价比，特别适合大规模、低成本推理
+
+## Connections
+- [[OpenAI]] — 同为 LLM Provider
+- [[Anthropic]] — 高精度基准 Provider