Sync: add ai finops and deployment notes
This commit is contained in:
30
wiki/entities/GoogleGemini.md
Normal file
30
wiki/entities/GoogleGemini.md
Normal file
@@ -0,0 +1,30 @@
|
||||
---
|
||||
title: "GoogleGemini"
|
||||
type: entity
|
||||
tags: ["llm-provider", "google", "gemini"]
|
||||
sources: ["engineering-autonomous-optimization-architect"]
|
||||
last_updated: 2026-04-26
|
||||
---
|
||||
|
||||
## Aliases
|
||||
- Gemini
|
||||
- Google Gemini
|
||||
- Gemini Flash
|
||||
- Gemini Pro
|
||||
|
||||
## Definition
|
||||
Google Gemini 是 Google 的 LLM 系列模型,涵盖从高性价比到高性能的多种版本。在 [[AutonomousOptimizationArchitect]] 系统中,Gemini Flash 因其极高的性价比(成本约为 Claude Opus 的 1/10)而被列为重要的路由目标。
|
||||
|
||||
## Role in LLM Routing
|
||||
- **Gemini Flash**:低成本高速度模型,如果精度达到基准的 98% 且成本远低于竞品,[[AutonomousOptimizationArchitect]] 会将流量自动路由至 Gemini
|
||||
- **Gemini Pro**:中端定位,提供能力与成本的平衡
|
||||
- 与 [[OpenAI]] 和 [[Anthropic]] 共同构成三足鼎立的 Provider 生态
|
||||
|
||||
## Key Properties
|
||||
- **Token 成本**:$0.075-0.5 / 1M tokens(Gemini Flash 极低)
|
||||
- **延迟**:低(Gemini Flash)
|
||||
- **优势**:极高的性价比,特别适合大规模、低成本推理
|
||||
|
||||
## Connections
|
||||
- [[OpenAI]] — 同为 LLM Provider
|
||||
- [[Anthropic]] — 高精度基准 Provider
|
||||
Reference in New Issue
Block a user