Sync: add ai finops and deployment notes

This commit is contained in:
2026-04-26 12:35:45 +08:00
parent f09834b5a5
commit ecdf295ded
14 changed files with 7117 additions and 5832 deletions

View File

@@ -0,0 +1,30 @@
---
title: "GoogleGemini"
type: entity
tags: ["llm-provider", "google", "gemini"]
sources: ["engineering-autonomous-optimization-architect"]
last_updated: 2026-04-26
---
## Aliases
- Gemini
- Google Gemini
- Gemini Flash
- Gemini Pro
## Definition
Google Gemini 是 Google 的 LLM 系列模型,涵盖从高性价比到高性能的多种版本。在 [[AutonomousOptimizationArchitect]] 系统中Gemini Flash 因其极高的性价比(成本约为 Claude Opus 的 1/10而被列为重要的路由目标。
## Role in LLM Routing
- **Gemini Flash**:低成本高速度模型,如果精度达到基准的 98% 且成本远低于竞品,[[AutonomousOptimizationArchitect]] 会将流量自动路由至 Gemini
- **Gemini Pro**:中端定位,提供能力与成本的平衡
- 与 [[OpenAI]] 和 [[Anthropic]] 共同构成三足鼎立的 Provider 生态
## Key Properties
- **Token 成本**$0.075-0.5 / 1M tokensGemini Flash 极低)
- **延迟**Gemini Flash
- **优势**:极高的性价比,特别适合大规模、低成本推理
## Connections
- [[OpenAI]] — 同为 LLM Provider
- [[Anthropic]] — 高精度基准 Provider