整理文件路径:Technical→AI/
This commit is contained in:
33
wiki/concepts/AI配音.md
Normal file
33
wiki/concepts/AI配音.md
Normal file
@@ -0,0 +1,33 @@
|
||||
---
|
||||
id: ai-voice
|
||||
title: "AI配音"
|
||||
type: concept
|
||||
tags: [TTS, voice, audio]
|
||||
sources:
|
||||
- "[[AI配音与声音克隆工具合集]]"
|
||||
last_updated: 2025-03-06
|
||||
---
|
||||
|
||||
## Definition
|
||||
|
||||
AI配音是文本转语音(TTS)技术,将文字内容转化为自然语音。
|
||||
|
||||
## Key Technologies
|
||||
|
||||
- **TTS**:Text-to-Speech,文字转语音
|
||||
- **声音克隆**:用少量样本重建个人声音
|
||||
|
||||
## Popular Tools
|
||||
|
||||
| 平台 | 特点 | 价格 |
|
||||
|------|------|------|
|
||||
| ElevenLabs | 国际顶流,30+语言,情感变化 | 付费较贵 |
|
||||
| 海螺AI | 小白友好,30秒克隆,中文好 | 免费 |
|
||||
| F5-TTS | 开源免费,2秒克隆,技术流 | 免费 |
|
||||
| TTSMaker | 每周3万字,50+语言,300+音色 | 免费限额 |
|
||||
| 剪映 | 抖音官方,短视频首选 | 部分VIP |
|
||||
| AnyVoice | 3秒克隆,中英日韩 | 免费无限 |
|
||||
|
||||
## Connections
|
||||
- [[二创视频]] ← uses ← [[AI配音]]
|
||||
- [[内容创作]] ← uses ← [[AI配音]]
|
||||
42
wiki/concepts/Agent.md
Normal file
42
wiki/concepts/Agent.md
Normal file
@@ -0,0 +1,42 @@
|
||||
---
|
||||
id: agent
|
||||
title: "Agent"
|
||||
type: concept
|
||||
tags: [AI, autonomous, tool-use]
|
||||
sources:
|
||||
- "[[LLM Terms Framework]]"
|
||||
last_updated: 2025-12-20
|
||||
---
|
||||
|
||||
## Definition
|
||||
|
||||
Agent(智能体)是LLM+MCP的组合,LLM负责给出步骤,MCP负责实际执行。
|
||||
|
||||
## How It Works
|
||||
|
||||
1. LLM理解用户意图
|
||||
2. LLM规划执行步骤
|
||||
3. MCP调用外部工具执行
|
||||
4. 结果反馈给LLM
|
||||
5. LLM继续下一步或返回结果
|
||||
|
||||
## Key Capabilities
|
||||
|
||||
- 自主决策
|
||||
- 工具调用
|
||||
- 任务分解
|
||||
- 迭代优化
|
||||
|
||||
## vs Vanilla LLM
|
||||
|
||||
| 维度 | Vanilla LLM | Agent |
|
||||
|------|-------------|-------|
|
||||
| 能力 | 仅生成文本 | 执行实际操作 |
|
||||
| 工具调用 | 无 | 有 |
|
||||
| 自主性 | 低 | 高 |
|
||||
| 幻觉风险 | 高 | 低(可验证) |
|
||||
|
||||
## Connections
|
||||
- [[Agent]] ← combines ← [[LLM]] + [[MCP]]
|
||||
- [[Agent]] ← extends ← [[LLM]]
|
||||
- [[Agent]] ← uses ← [[工具调用]]
|
||||
40
wiki/concepts/AgenticAI.md
Normal file
40
wiki/concepts/AgenticAI.md
Normal file
@@ -0,0 +1,40 @@
|
||||
---
|
||||
id: agentic-ai
|
||||
title: "Agentic AI"
|
||||
type: concept
|
||||
tags: [AI, agent, autonomous, proactive]
|
||||
sources:
|
||||
- "[[Designing for Agentic AI]]"
|
||||
last_updated: 2025-03-02
|
||||
---
|
||||
|
||||
## Definition
|
||||
|
||||
Agentic AI是能够自主行动和决策的AI系统,能够预判用户需求并主动执行任务。
|
||||
|
||||
## Key Characteristics
|
||||
|
||||
- **主动预判**:不需要用户明确指令,主动分析并行动
|
||||
- **实时反馈**:持续向用户展示决策过程
|
||||
- **用户控制**:确保用户对AI行为有最终决定权
|
||||
- **行动执行**:不仅生成内容,而是执行具体操作
|
||||
|
||||
## Five Design Principles
|
||||
|
||||
1. **透明性**:让用户理解AI的决策过程
|
||||
2. **控制权**:用户始终保持对AI行为的最终决定权
|
||||
3. **个性化**:AI适应用户的偏好和习惯
|
||||
4. **对话**:通过自然语言进行持续交互
|
||||
5. **预判**:AI主动识别并满足用户潜在需求
|
||||
|
||||
## vs GenAI
|
||||
|
||||
| 维度 | GenAI | Agentic AI |
|
||||
|------|-------|------------|
|
||||
| 核心能力 | 内容生成 | 行动执行 |
|
||||
| 交互模式 | 被动响应 | 主动预判 |
|
||||
| 反馈机制 | 单次响应 | 实时反馈 |
|
||||
|
||||
## Connections
|
||||
- [[Agentic AI]] ← extends ← [[GenAI]]
|
||||
- [[AI产品设计]] ← uses ← [[Agentic AI设计原则]]
|
||||
55
wiki/concepts/ClaudeSkills.md
Normal file
55
wiki/concepts/ClaudeSkills.md
Normal file
@@ -0,0 +1,55 @@
|
||||
---
|
||||
id: claude-skills
|
||||
title: "Claude Skills"
|
||||
type: concept
|
||||
tags: [Anthropic, Claude, skill, SOP]
|
||||
sources:
|
||||
- "[[Claude Skills最值得研究的AI范式]]"
|
||||
last_updated: 2026-01-05
|
||||
---
|
||||
|
||||
## Definition
|
||||
|
||||
Claude Skills是Anthropic官方发布的AI技能指南,本质是写给Claude的"说明书"和"SOP"。
|
||||
|
||||
## What It Contains
|
||||
|
||||
- Prompt结构定义
|
||||
- 参数含义说明
|
||||
- 容错策略
|
||||
- 使用示例
|
||||
|
||||
## Official Skills Categories
|
||||
|
||||
- 办公自动化四大件(Word/PDF/PPT/Excel)
|
||||
- 开发者工具箱
|
||||
- 创意类Skill
|
||||
|
||||
## Awesome Claude Skills
|
||||
|
||||
三大社区仓库:
|
||||
- ComposioHQ
|
||||
- VoltAgent
|
||||
- BehiSecc
|
||||
|
||||
## Skills聚合站
|
||||
|
||||
- skillsmp.com
|
||||
- aitmpl.com/skills
|
||||
- claudemarketplaces.com
|
||||
|
||||
## Significance
|
||||
|
||||
Skills的爆发标志着从**提示词工程**到**流程工程**的关键转变:
|
||||
- 将经验沉淀为SOP
|
||||
- 交给AI稳定执行
|
||||
- 实现可复用的工作流
|
||||
|
||||
## Connection to Vibe Coding
|
||||
|
||||
Vibe Coding的尽头也是Skills,通过AI编程方式构建的流程最终需要Skills来标准化和复用。
|
||||
|
||||
## Connections
|
||||
- [[提示词工程]] ← evolves_to ← [[流程工程]]
|
||||
- [[Claude Skills]] ← implements ← [[SOP标准化]]
|
||||
- [[Vibe Coding]] ← uses ← [[Claude Skills]]
|
||||
32
wiki/concepts/Embedding.md
Normal file
32
wiki/concepts/Embedding.md
Normal file
@@ -0,0 +1,32 @@
|
||||
---
|
||||
id: embedding
|
||||
title: "Embedding"
|
||||
type: concept
|
||||
tags: [LLM, vector, representation]
|
||||
sources:
|
||||
- "[[RAG从入门到精通系列1:基础RAG]]"
|
||||
- "[[LLM Terms Framework]]"
|
||||
last_updated: 2025-12-18
|
||||
---
|
||||
|
||||
## Definition
|
||||
|
||||
Embedding(向量化)是将文本转换为数值向量的技术,使计算机能够计算词与词之间的距离和语义关系。
|
||||
|
||||
## Mechanism
|
||||
|
||||
- 将文本映射到高维向量空间
|
||||
- 语义相似的文本在向量空间中距离更近
|
||||
- 支持相似度搜索和聚类分析
|
||||
|
||||
## Use Cases
|
||||
|
||||
- RAG系统的文档索引
|
||||
- 语义搜索
|
||||
- 文本相似度比较
|
||||
- 推荐系统
|
||||
|
||||
## Connections
|
||||
- [[LLM]] ← uses ← [[Embedding]]
|
||||
- [[RAG]] ← uses ← [[Embedding]]
|
||||
- [[向量数据库]] ← stores ← [[Embedding]]
|
||||
31
wiki/concepts/GenAI.md
Normal file
31
wiki/concepts/GenAI.md
Normal file
@@ -0,0 +1,31 @@
|
||||
---
|
||||
id: genai
|
||||
title: "GenAI"
|
||||
type: concept
|
||||
tags: [AI, generation, content-creation]
|
||||
sources:
|
||||
- "[[Designing for Agentic AI]]"
|
||||
last_updated: 2025-03-02
|
||||
---
|
||||
|
||||
## Definition
|
||||
|
||||
GenAI(生成式AI)擅长创作内容,如文本、图像、代码、音乐等。
|
||||
|
||||
## Key Characteristics
|
||||
|
||||
- 内容生成能力强
|
||||
- 被动响应用户请求
|
||||
- 适合创意类任务
|
||||
|
||||
## vs Agentic AI
|
||||
|
||||
| 维度 | GenAI | Agentic AI |
|
||||
|------|-------|------------|
|
||||
| 核心能力 | 内容生成 | 行动执行 |
|
||||
| 交互模式 | 被动响应 | 主动预判 |
|
||||
| 代表任务 | 写作、绘画 | 自动化工作流 |
|
||||
|
||||
## Connections
|
||||
- [[Agentic AI]] ← evolves_from ← [[GenAI]]
|
||||
- [[AI产品设计]] ← uses ← [[GenAI]]
|
||||
41
wiki/concepts/LLM.md
Normal file
41
wiki/concepts/LLM.md
Normal file
@@ -0,0 +1,41 @@
|
||||
---
|
||||
id: llm
|
||||
title: "LLM"
|
||||
type: concept
|
||||
tags: [AI, language-model, foundation-model]
|
||||
sources:
|
||||
- "[[LLM Terms Framework]]"
|
||||
last_updated: 2025-12-20
|
||||
---
|
||||
|
||||
## Definition
|
||||
|
||||
LLM(Large Language Model,大语言模型)是参数规模≥1B的深度学习模型,能够理解和生成人类语言。
|
||||
|
||||
## Core Properties
|
||||
|
||||
- **参数规模**:通常≥10亿参数
|
||||
- **语言理解**:能够理解复杂语义
|
||||
- **文本生成**:能够生成连贯、合法的文本
|
||||
- **上下文学习**:能从少量示例中学习
|
||||
|
||||
## Key Metrics
|
||||
|
||||
- **Token**:基本输入单元
|
||||
- 1英文字符 ≈ 0.3 token
|
||||
- 1中文字符 ≈ 0.6 token
|
||||
- **Context Window**:模型能接受的上下文长度
|
||||
|
||||
## Related Concepts
|
||||
|
||||
- [[Token]]:LLM的基本输入单元
|
||||
- [[MCP]]:LLM与外部工具的连接协议
|
||||
- [[Agent]]:LLM+MCP的智能体
|
||||
- [[RAG]]:扩展LLM能力的技术
|
||||
- [[Embedding]]:LLM理解文本的基础
|
||||
|
||||
## Connections
|
||||
- [[LLM]] ← uses ← [[Token]]
|
||||
- [[LLM]] ← uses ← [[MCP]]
|
||||
- [[Agent]] ← combines ← [[LLM]] + [[MCP]]
|
||||
- [[RAG]] ← extends ← [[LLM]]
|
||||
43
wiki/concepts/MCP.md
Normal file
43
wiki/concepts/MCP.md
Normal file
@@ -0,0 +1,43 @@
|
||||
---
|
||||
id: mcp
|
||||
title: "MCP"
|
||||
type: concept
|
||||
tags: [AI, protocol, tool-integration]
|
||||
sources:
|
||||
- "[[LLM Terms Framework]]"
|
||||
last_updated: 2025-12-20
|
||||
---
|
||||
|
||||
## Definition
|
||||
|
||||
MCP(Model Context Protocol,模型上下文协议)是一种标准化接口,用于连接大模型与外部数据和工具。
|
||||
|
||||
## Purpose
|
||||
|
||||
解决LLM无法访问实时数据和外部工具的问题:
|
||||
- LLM给出执行步骤
|
||||
- 实际执行需要配合MCP
|
||||
- 实现智能体(Agent)功能
|
||||
|
||||
## Architecture
|
||||
|
||||
- **Client**:运行在AI应用端
|
||||
- **Server**:运行在外部服务或本地
|
||||
|
||||
## Use Cases
|
||||
|
||||
- 文件系统访问
|
||||
- API调用
|
||||
- 数据库查询
|
||||
- 代码执行
|
||||
|
||||
## Connection to Agent
|
||||
|
||||
Agent = LLM + MCP
|
||||
- LLM负责理解和规划
|
||||
- MCP负责执行具体操作
|
||||
|
||||
## Connections
|
||||
- [[LLM]] ← uses ← [[MCP]]
|
||||
- [[Agent]] ← combines ← [[LLM]] + [[MCP]]
|
||||
- [[MCP]] ← enables ← [[工具调用]]
|
||||
39
wiki/concepts/Prompt能力.md
Normal file
39
wiki/concepts/Prompt能力.md
Normal file
@@ -0,0 +1,39 @@
|
||||
---
|
||||
id: prompt-ability
|
||||
title: "Prompt能力"
|
||||
type: concept
|
||||
tags: [prompt-engineering, communication]
|
||||
sources:
|
||||
- "[[如何写出完美的Prompt]]"
|
||||
last_updated: 2025-12-02
|
||||
---
|
||||
|
||||
## Definition
|
||||
|
||||
Prompt能力是清晰界定需求+结构化思维表达的能力,本质是需求拆解+结构化表达能力。
|
||||
|
||||
## Core Elements
|
||||
|
||||
人与AI的协作协议,定义:
|
||||
- **做什么**:明确任务目标
|
||||
- **为什么**:任务背景和目的
|
||||
- **给谁**:目标受众
|
||||
- **怎么做**:执行方式和约束
|
||||
- **做到什么标准**:质量要求和验收标准
|
||||
|
||||
## Four Key Elements
|
||||
|
||||
1. **角色**:AI扮演的身份
|
||||
2. **受众对齐**:明确目标用户
|
||||
3. **场景对齐**:使用环境上下文
|
||||
4. **目标对齐**:预期成果定义
|
||||
|
||||
## Common Mistakes
|
||||
|
||||
- 越复杂越专业
|
||||
- 说清做什么就行
|
||||
- 一键生成即终点
|
||||
|
||||
## Connections
|
||||
- [[AI协作]] ← requires ← [[Prompt能力]]
|
||||
- [[结构化思维]] ← enables ← [[Prompt能力]]
|
||||
41
wiki/concepts/RAG.md
Normal file
41
wiki/concepts/RAG.md
Normal file
@@ -0,0 +1,41 @@
|
||||
---
|
||||
id: rag
|
||||
title: "RAG"
|
||||
type: concept
|
||||
tags: [LLM, retrieval, augmentation]
|
||||
sources:
|
||||
- "[[RAG从入门到精通系列1:基础RAG]]"
|
||||
- "[[LLM Terms Framework]]"
|
||||
last_updated: 2025-12-18
|
||||
---
|
||||
|
||||
## Definition
|
||||
|
||||
RAG(Retrieval Augmented Generation,检索增强生成)是一种结合检索系统和LLM生成的技术,解决LLM缺乏最新和私有数据的问题。
|
||||
|
||||
## Three-Step Process
|
||||
|
||||
1. **索引(Indexing)**:将文档切分并转换为Embedding向量存入向量数据库
|
||||
2. **检索(Retrieval)**:根据问题语义向量检索相关文档块
|
||||
3. **生成(Generation)**:将问题和相关文档输入LLM生成答案
|
||||
|
||||
## Key Components
|
||||
|
||||
- **Embedding**:将文本转换为数值向量
|
||||
- **向量数据库**:存储和检索向量表示(如Qdrant)
|
||||
- **文档切分**:将长文档分割成符合Embedding窗口的块
|
||||
- **Context Window**:模型能接受的上下文长度限制(512-8192 token)
|
||||
|
||||
## Why It Matters
|
||||
|
||||
解决LLM的幻觉问题,让模型能够:
|
||||
- 访问最新信息
|
||||
- 利用私有数据
|
||||
- 提供可溯源的回答
|
||||
|
||||
## Connections
|
||||
- [[LLM]] ← uses ← [[RAG]]
|
||||
- [[RAG]] ← includes ← [[索引]]
|
||||
- [[RAG]] ← includes ← [[检索]]
|
||||
- [[RAG]] ← includes ← [[生成]]
|
||||
- [[RAG]] ← extends ← [[LLM]]
|
||||
30
wiki/concepts/SourceGrounding.md
Normal file
30
wiki/concepts/SourceGrounding.md
Normal file
@@ -0,0 +1,30 @@
|
||||
---
|
||||
id: source-grounding
|
||||
title: "Source-Grounding"
|
||||
type: concept
|
||||
tags: [NotebookLM, accuracy, grounding]
|
||||
sources:
|
||||
- "[[7 ways I use NotebookLM to make my life easier]]"
|
||||
last_updated: 2025-11-23
|
||||
---
|
||||
|
||||
## Definition
|
||||
|
||||
Source-Grounding是NotebookLM的核心机制,限制知识库仅包含用户上传的文档,确保AI回答准确且可溯源。
|
||||
|
||||
## Mechanism
|
||||
|
||||
- 用户上传文档后,NotebookLM只在这个文档范围内回答
|
||||
- 避免AI幻觉,确保回答有据可查
|
||||
- 每个回答都附带源文档引用
|
||||
|
||||
## Why It Matters
|
||||
|
||||
解决通用LLM的幻觉问题,特别适用于:
|
||||
- 法律文档审查
|
||||
- 学术研究
|
||||
- 精确信息查询
|
||||
|
||||
## Connections
|
||||
- [[NotebookLM]] ← uses ← [[Source-Grounding]]
|
||||
- [[AI准确性]] ← requires ← [[Source-Grounding]]
|
||||
37
wiki/concepts/Token.md
Normal file
37
wiki/concepts/Token.md
Normal file
@@ -0,0 +1,37 @@
|
||||
---
|
||||
id: token
|
||||
title: "Token"
|
||||
type: concept
|
||||
tags: [LLM, tokenization, input-unit]
|
||||
sources:
|
||||
- "[[LLM Terms Framework]]"
|
||||
last_updated: 2025-12-20
|
||||
---
|
||||
|
||||
## Definition
|
||||
|
||||
Token是大模型的基本输入单元,是文本处理的最小单位。
|
||||
|
||||
## Tokenization Rules
|
||||
|
||||
- 1英文字符 ≈ 0.3 token
|
||||
- 1中文字符 ≈ 0.6 token
|
||||
- 标点符号和空格也占用token
|
||||
|
||||
## Why It Matters
|
||||
|
||||
- 影响API调用成本
|
||||
- 决定上下文长度限制
|
||||
- 影响生成速度
|
||||
|
||||
## Context Window
|
||||
|
||||
模型能接受的token数量限制:
|
||||
- 较短的模型:4K-8K tokens
|
||||
- 中等模型:32K-128K tokens
|
||||
- 长上下文模型:1M+ tokens
|
||||
|
||||
## Connections
|
||||
- [[LLM]] ← uses ← [[Token]]
|
||||
- [[Token]] → affects → [[成本计算]]
|
||||
- [[Token]] → affects → [[上下文限制]]
|
||||
32
wiki/concepts/VibeCoding.md
Normal file
32
wiki/concepts/VibeCoding.md
Normal file
@@ -0,0 +1,32 @@
|
||||
---
|
||||
id: vibe-coding
|
||||
title: "Vibe Coding"
|
||||
type: concept
|
||||
tags: [AI, programming, coding]
|
||||
sources:
|
||||
- "[[Claude Skills最值得研究的AI范式]]"
|
||||
last_updated: 2026-01-05
|
||||
---
|
||||
|
||||
## Definition
|
||||
|
||||
Vibe Coding是一种AI编程方式,通过自然语言与AI协作编写代码。
|
||||
|
||||
## Characteristics
|
||||
|
||||
- 自然语言为主
|
||||
- AI生成代码
|
||||
- 人类审核和调整
|
||||
- 降低编程门槛
|
||||
|
||||
## The End State
|
||||
|
||||
Vibe Coding的尽头是Skills:
|
||||
- 通过对话构建的代码和流程
|
||||
- 需要标准化为Skills以便复用
|
||||
- 最终沉淀为可维护的系统
|
||||
|
||||
## Connections
|
||||
- [[Vibe Coding]] ← uses ← [[Claude Skills]]
|
||||
- [[AI编程]] ← extends ← [[Vibe Coding]]
|
||||
- [[提示词工程]] ← relates_to ← [[Vibe Coding]]
|
||||
66
wiki/concepts/Workspace.md
Normal file
66
wiki/concepts/Workspace.md
Normal file
@@ -0,0 +1,66 @@
|
||||
---
|
||||
title: "Workspace"
|
||||
type: concept
|
||||
tags: [openclaw, workspace, configuration]
|
||||
last_updated: 2026-04-14
|
||||
---
|
||||
|
||||
# Workspace
|
||||
|
||||
OpenClaw中Agent的工作台目录,决定Agent如何工作。
|
||||
|
||||
## 核心文件
|
||||
|
||||
### AGENTS.md
|
||||
定义Agent的:
|
||||
- 岗位职责
|
||||
- 行为边界
|
||||
- 多Agent协调规则
|
||||
|
||||
### SOUL.md
|
||||
定义Agent的:
|
||||
- 性格叙事
|
||||
- 沟通风格
|
||||
- 价值观和边界
|
||||
|
||||
### USER.md
|
||||
固化用户的:
|
||||
- 偏好设定
|
||||
- 背景知识假设
|
||||
- 常见任务
|
||||
|
||||
### TOOLS.md
|
||||
声明工具的:
|
||||
- 可用工具
|
||||
- 使用原则
|
||||
- 受限工具
|
||||
|
||||
### IDENTITY.md
|
||||
结构化身份档案:
|
||||
- 名字
|
||||
- 角色类型
|
||||
- Emoji
|
||||
- 头像
|
||||
|
||||
### BOOTSTRAP.md
|
||||
一次性启动引导,完成后应删除。
|
||||
|
||||
### memory/
|
||||
长期记忆目录:
|
||||
- 按日期滚动的记忆笔记
|
||||
- 实现跨会话上下文保留
|
||||
|
||||
## 配置要点
|
||||
|
||||
1. **边界比能力更重要**:明确"不要做什么"
|
||||
2. **场景触发优于通用指令**:具体场景下的具体规则
|
||||
3. **简洁有效**:300-500字的AGENTS.md比2000字的更有效
|
||||
|
||||
## 与openclaw.json的关系
|
||||
- Workspace文件:管"Agent平时怎么干活"
|
||||
- openclaw.json:管"系统怎么跑Agent"
|
||||
|
||||
## 相关概念
|
||||
- [[OpenClaw]]
|
||||
- [[AGENTS.md]]
|
||||
- [[SOUL.md]]
|
||||
36
wiki/concepts/vLLM.md
Normal file
36
wiki/concepts/vLLM.md
Normal file
@@ -0,0 +1,36 @@
|
||||
---
|
||||
id: vllm
|
||||
title: "vLLM"
|
||||
type: concept
|
||||
tags: [LLM, inference, GPU, optimization]
|
||||
sources:
|
||||
- "[[LLM Terms Framework]]"
|
||||
last_updated: 2025-12-20
|
||||
---
|
||||
|
||||
## Definition
|
||||
|
||||
vLLM是一个高效LLM推理框架,通过KV Cache和连续批处理提升GPU利用率。
|
||||
|
||||
## Key Optimizations
|
||||
|
||||
### KV Cache
|
||||
- 缓存已计算的Key-Value矩阵
|
||||
- 避免重复计算
|
||||
- 大幅提升推理速度
|
||||
|
||||
### Continuous Batching
|
||||
- 动态批处理多个请求
|
||||
- 提高GPU利用率
|
||||
- 降低延迟
|
||||
|
||||
## Why It Matters
|
||||
|
||||
- 官方HuggingFace推理速度慢
|
||||
- vLLM可提升10-24倍速度
|
||||
- 支持高并发推理
|
||||
|
||||
## Connections
|
||||
- [[LLM]] ← uses ← [[vLLM]]
|
||||
- [[推理优化]] ← uses ← [[vLLM]]
|
||||
- [[GPU利用率]] ← improves ← [[vLLM]]
|
||||
38
wiki/concepts/九宫格法.md
Normal file
38
wiki/concepts/九宫格法.md
Normal file
@@ -0,0 +1,38 @@
|
||||
---
|
||||
id: nine-grid
|
||||
title: "九宫格法"
|
||||
type: concept
|
||||
tags: [video, AI, image-generation]
|
||||
sources:
|
||||
- "[[固定镜头短视频AI全流程制作]]"
|
||||
last_updated: 2025-03-15
|
||||
---
|
||||
|
||||
## Definition
|
||||
|
||||
九宫格法是一次性生成3x3共九个分镜画面的方法,确保多个镜头之间的画面一致性。
|
||||
|
||||
## Mechanism
|
||||
|
||||
1. 将视频分割为9个分镜
|
||||
2. 一次性生成3x3网格图像
|
||||
3. 每个格子是一个分镜的关键帧
|
||||
4. 确保人物/场景在多个格子中保持一致
|
||||
|
||||
## Why It Works
|
||||
|
||||
- AI在单张图像内保持一致性更容易
|
||||
- 避免逐帧生成导致的人物变形
|
||||
- 提高多镜头视频的整体质量
|
||||
|
||||
## Five-Step Formula
|
||||
|
||||
1. 拆分镜头
|
||||
2. 一致性图像生成(九宫格法)
|
||||
3. 首尾针动画
|
||||
4. 快速剪辑
|
||||
5. 声音设计
|
||||
|
||||
## Connections
|
||||
- [[AI视频制作]] ← uses ← [[九宫格法]]
|
||||
- [[分镜设计]] ← uses ← [[九宫格法]]
|
||||
37
wiki/concepts/固定机位.md
Normal file
37
wiki/concepts/固定机位.md
Normal file
@@ -0,0 +1,37 @@
|
||||
---
|
||||
id: fixed-camera
|
||||
title: "固定机位"
|
||||
type: concept
|
||||
tags: [video-production, cinematography]
|
||||
sources:
|
||||
- "[[固定镜头短视频AI全流程制作]]"
|
||||
last_updated: 2025-03-15
|
||||
---
|
||||
|
||||
## Definition
|
||||
|
||||
固定机位是摄像机位置固定不变的拍摄方式,是固定镜头短视频的核心特征。
|
||||
|
||||
## Key Characteristics
|
||||
|
||||
- 摄像机位置不变
|
||||
- 只有画面内容变化
|
||||
- 适合展示时间流逝
|
||||
- 便于AI生成一致性画面
|
||||
|
||||
## Use Cases
|
||||
|
||||
- 家装视频
|
||||
- 产品展示
|
||||
- 教程演示
|
||||
- 时间压缩视频
|
||||
|
||||
## Connection to AI Video
|
||||
|
||||
固定机位降低AI视频生成的复杂度,通过:
|
||||
- 九宫格法保证画面一致性
|
||||
- 首尾针动画实现平滑过渡
|
||||
|
||||
## Connections
|
||||
- [[AI视频制作]] ← uses ← [[固定机位]]
|
||||
- [[短视频制作]] ← uses ← [[固定机位]]
|
||||
32
wiki/concepts/声音克隆.md
Normal file
32
wiki/concepts/声音克隆.md
Normal file
@@ -0,0 +1,32 @@
|
||||
---
|
||||
id: voice-cloning
|
||||
title: "声音克隆"
|
||||
type: concept
|
||||
tags: [TTS, voice, cloning]
|
||||
sources:
|
||||
- "[[AI配音与声音克隆工具合集]]"
|
||||
last_updated: 2025-03-06
|
||||
---
|
||||
|
||||
## Definition
|
||||
|
||||
声音克隆是用少量音频样本重建个人声音特征的技术。
|
||||
|
||||
## How It Works
|
||||
|
||||
1. 收集目标声音的短音频(2-30秒)
|
||||
2. 提取声音特征
|
||||
3. 生成新的语音内容
|
||||
|
||||
## Speed Comparison
|
||||
|
||||
| 工具 | 克隆速度 | 技术门槛 |
|
||||
|------|----------|----------|
|
||||
| F5-TTS | 2秒 | 高(需代码) |
|
||||
| 海螺AI | 30秒 | 低 |
|
||||
| AnyVoice | 3秒 | 低 |
|
||||
| ElevenLabs | 30秒 | 低 |
|
||||
|
||||
## Connections
|
||||
- [[AI配音]] ← uses ← [[声音克隆]]
|
||||
- [[内容创作]] ← uses ← [[声音克隆]]
|
||||
64
wiki/concepts/多Agent系统.md
Normal file
64
wiki/concepts/多Agent系统.md
Normal file
@@ -0,0 +1,64 @@
|
||||
---
|
||||
title: "多Agent系统"
|
||||
type: concept
|
||||
tags: [multi-agent, collaboration, agent]
|
||||
last_updated: 2026-04-14
|
||||
---
|
||||
|
||||
# 多Agent系统
|
||||
|
||||
多个专业Agent协同工作的架构模式,每个Agent有独特的角色和职责。
|
||||
|
||||
## 核心模式
|
||||
|
||||
### 分散式协调
|
||||
通过共享STATE.yaml文件协调,而非中央orchestrator:
|
||||
- Agent读写共享状态文件
|
||||
- 多子Agent并行工作
|
||||
- 主会话保持精简(CEO模式)
|
||||
|
||||
### STATE.yaml
|
||||
项目协调文件,作为单一事实来源:
|
||||
```yaml
|
||||
project: website-redesign
|
||||
tasks:
|
||||
- id: homepage-hero
|
||||
status: in_progress
|
||||
owner: pm-frontend
|
||||
```
|
||||
|
||||
### 团队配置示例
|
||||
- [[Milo]]:策略Lead
|
||||
- [[Josh]]:商业分析
|
||||
- Marketing Agent:营销研究
|
||||
- Dev Agent:开发
|
||||
|
||||
## 关键优势
|
||||
|
||||
1. **专业化分工**:每个Agent专注特定领域
|
||||
2. **并行执行**:多任务同时处理
|
||||
3. **可扩展性**:新增Agent无需修改主逻辑
|
||||
4. **共享记忆**:团队成员共享项目上下文
|
||||
|
||||
## 协作机制
|
||||
|
||||
- **Telegram路由**:通过标签分配到不同Agent
|
||||
- **共享内存**:项目文档、目标、决策
|
||||
- **私有上下文**:每个Agent独有会话历史
|
||||
- **定时任务**:Agent主动工作
|
||||
|
||||
## Race Condition处理
|
||||
|
||||
当多个Agent编辑同一文件时:
|
||||
1. AUTONOMOUS.md:仅主会话编辑
|
||||
2. memory/tasks-log.md:仅追加,子Agent只添加新行
|
||||
|
||||
## 使用场景
|
||||
|
||||
- [[多Agent专业团队]]
|
||||
- [[多Agent内容工厂]]
|
||||
- [[自主项目管理]]
|
||||
- [[动态仪表板]]
|
||||
|
||||
## 相关链接
|
||||
- [Anthropic: Building Effective Agents](https://www.anthropic.com/research/building-effective-agents)
|
||||
59
wiki/concepts/工作流自动化.md
Normal file
59
wiki/concepts/工作流自动化.md
Normal file
@@ -0,0 +1,59 @@
|
||||
---
|
||||
title: "工作流自动化"
|
||||
type: concept
|
||||
tags: [automation, workflow, n8n]
|
||||
last_updated: 2026-04-14
|
||||
---
|
||||
|
||||
# 工作流自动化
|
||||
|
||||
使用工具自动执行重复性任务,减少人工干预。
|
||||
|
||||
## 核心概念
|
||||
|
||||
### 工作流(Workflow)
|
||||
由多个任务节点按一定顺序执行的自动化流程。
|
||||
|
||||
### 节点(Node)
|
||||
工作流中的单个操作单元:
|
||||
- 触发器:启动工作流
|
||||
- 动作:执行具体操作
|
||||
- 工具:辅助功能
|
||||
- 代码:自定义逻辑
|
||||
- AI节点:嵌入AI能力
|
||||
|
||||
## 与AI Agent的关系
|
||||
|
||||
- **Workflow**:预定义自动化,一致输出
|
||||
- **Agent**:基于LLM动态决定工具和输出
|
||||
- **Agentic系统**:结合两者优势
|
||||
|
||||
## 平台
|
||||
|
||||
### N8N
|
||||
- 可视化拖拽界面
|
||||
- 400+预构建集成
|
||||
- 支持自托管
|
||||
|
||||
### OpenClaw
|
||||
- 通过skill扩展能力
|
||||
- 自然语言配置
|
||||
- 记忆和上下文保留
|
||||
|
||||
## 安全集成模式
|
||||
|
||||
[[OpenClaw + n8n工作流编排]]:
|
||||
- Webhook调用n8n
|
||||
- 凭证隔离在n8n
|
||||
- 工作流可锁定
|
||||
|
||||
## 使用场景
|
||||
|
||||
- [[会议纪要自动化]]
|
||||
- [[邮件管理自动化]]
|
||||
- [[日历聚合]]
|
||||
- [[社交媒体自动化]]
|
||||
|
||||
## 相关链接
|
||||
- [N8N官网](https://n8n.io/)
|
||||
- [OpenClaw文档](https://docs.openclaw.ai)
|
||||
31
wiki/concepts/思维链引导.md
Normal file
31
wiki/concepts/思维链引导.md
Normal file
@@ -0,0 +1,31 @@
|
||||
---
|
||||
id: chain-of-thought
|
||||
title: "思维链引导"
|
||||
type: concept
|
||||
tags: [prompt-engineering, reasoning]
|
||||
sources:
|
||||
- "[[如何写出完美的Prompt]]"
|
||||
last_updated: 2025-12-02
|
||||
---
|
||||
|
||||
## Definition
|
||||
|
||||
思维链引导是一种提示词技术,让AI逐步推理而非直接给出答案。
|
||||
|
||||
## Mechanism
|
||||
|
||||
通过在提示词中要求AI展示推理过程:
|
||||
- 先分析问题
|
||||
- 再列出步骤
|
||||
- 最后给出答案
|
||||
|
||||
## Benefits
|
||||
|
||||
- 提高AI推理准确性
|
||||
- 减少幻觉发生
|
||||
- 让用户理解决策过程
|
||||
- 便于发现AI思维漏洞
|
||||
|
||||
## Connections
|
||||
- [[Prompt能力]] ← uses ← [[思维链引导]]
|
||||
- [[需求拆解]] ← extends ← [[思维链引导]]
|
||||
62
wiki/concepts/技能系统.md
Normal file
62
wiki/concepts/技能系统.md
Normal file
@@ -0,0 +1,62 @@
|
||||
---
|
||||
title: "技能系统"
|
||||
type: concept
|
||||
tags: [skill, openclaw, extension]
|
||||
last_updated: 2026-04-14
|
||||
---
|
||||
|
||||
# 技能系统
|
||||
|
||||
OpenClaw的扩展机制,通过技能包添加新能力。
|
||||
|
||||
## 技能结构
|
||||
|
||||
```
|
||||
skills/
|
||||
├── skill-name/
|
||||
│ └── SKILL.md
|
||||
```
|
||||
|
||||
## 常用技能
|
||||
|
||||
### 集成技能
|
||||
- [[Telegram]]:消息通道
|
||||
- [[Discord]]:协作平台
|
||||
- [[Slack]]:团队通讯
|
||||
|
||||
### 数据技能
|
||||
- [[YouTube]]:视频内容获取
|
||||
- [[Reddit]]:社区内容聚合
|
||||
- [[GitHub]]:代码和项目数据
|
||||
|
||||
### 工具技能
|
||||
- [[arxiv-reader]]:学术论文读取
|
||||
- [[latex-compiler]]:LaTeX编译
|
||||
- [[youtube-full]]:YouTube完整集成
|
||||
|
||||
### MCP技能
|
||||
- [[n8n-mcp]]:N8N节点访问
|
||||
- [[idea-reality-mcp]]:创意验证
|
||||
|
||||
## 技能安装
|
||||
|
||||
通过ClawHub安装:
|
||||
```bash
|
||||
npx clawhub@latest install skill-name
|
||||
```
|
||||
|
||||
或通过OpenClaw:
|
||||
```text
|
||||
Install the youtube-full skill
|
||||
```
|
||||
|
||||
## 技能开发
|
||||
|
||||
技能是包含SKILL.md的目录,定义:
|
||||
- 工具列表
|
||||
- 使用方法
|
||||
- 配置要求
|
||||
|
||||
## 使用场景
|
||||
|
||||
详见各use case中的"Skills you Need"部分。
|
||||
33
wiki/concepts/提示词框架.md
Normal file
33
wiki/concepts/提示词框架.md
Normal file
@@ -0,0 +1,33 @@
|
||||
---
|
||||
id: prompt-framework
|
||||
title: "提示词框架"
|
||||
type: concept
|
||||
tags: [Nano Banana, prompt-engineering, image-generation]
|
||||
sources:
|
||||
- "[[Nano Banana提示词框架]]"
|
||||
last_updated: 2025-03-15
|
||||
---
|
||||
|
||||
## Definition
|
||||
|
||||
提示词框架是结构化描述图像生成需求的模板,通过标准化字段确保生成质量可控。
|
||||
|
||||
## Framework Types
|
||||
|
||||
### 物件描述框架
|
||||
- shot:镜头类型
|
||||
- subject:包含item/materials/details/condition
|
||||
- environment:环境描述
|
||||
- lighting:光线
|
||||
- camera:相机设置
|
||||
- color_grade:色彩分级
|
||||
- style:风格
|
||||
- quality:质量参数
|
||||
- negatives:负面提示
|
||||
|
||||
### 人物描述框架
|
||||
- subject:包含age/appearance/pose等字段
|
||||
|
||||
## Connections
|
||||
- [[AI图像生成]] ← uses ← [[提示词框架]]
|
||||
- [[Nano Banana]] ← supports ← [[提示词框架]]
|
||||
34
wiki/concepts/流程工程.md
Normal file
34
wiki/concepts/流程工程.md
Normal file
@@ -0,0 +1,34 @@
|
||||
---
|
||||
id: workflow-engineering
|
||||
title: "流程工程"
|
||||
type: concept
|
||||
tags: [AI, workflow, SOP, engineering]
|
||||
sources:
|
||||
- "[[Claude Skills最值得研究的AI范式]]"
|
||||
last_updated: 2026-01-05
|
||||
---
|
||||
|
||||
## Definition
|
||||
|
||||
流程工程是将重复任务拆解为AI能理解、稳定复用的流程,并通过Skills实现标准化的工程化方法。
|
||||
|
||||
## vs 提示词工程
|
||||
|
||||
| 维度 | 提示词工程 | 流程工程 |
|
||||
|------|------------|----------|
|
||||
| 核心 | 单次Prompt优化 | 全流程标准化 |
|
||||
| 稳定性 | 依赖模型表现 | SOP固化 |
|
||||
| 复用性 | 低 | 高 |
|
||||
| 目标 | 一次好结果 | 稳定可重复 |
|
||||
|
||||
## Key Elements
|
||||
|
||||
- **SOP标准化**:将经验沉淀为操作步骤
|
||||
- **Skills封装**:AI技能的模块化
|
||||
- **自动化执行**:交给AI稳定运行
|
||||
- **反馈迭代**:持续优化流程
|
||||
|
||||
## Connections
|
||||
- [[提示词工程]] ← evolves_to ← [[流程工程]]
|
||||
- [[Claude Skills]] ← implements ← [[流程工程]]
|
||||
- [[SOP标准化]] ← enables ← [[流程工程]]
|
||||
31
wiki/concepts/结构化表达.md
Normal file
31
wiki/concepts/结构化表达.md
Normal file
@@ -0,0 +1,31 @@
|
||||
---
|
||||
id: structured-expression
|
||||
title: "结构化表达"
|
||||
type: concept
|
||||
tags: [prompt-engineering, communication]
|
||||
sources:
|
||||
- "[[如何写出完美的Prompt]]"
|
||||
last_updated: 2025-12-02
|
||||
---
|
||||
|
||||
## Definition
|
||||
|
||||
结构化表达是用清晰逻辑组织信息的方法,确保AI准确理解人类意图。
|
||||
|
||||
## Principles
|
||||
|
||||
- 层次分明:按重要性和逻辑顺序组织
|
||||
- 格式统一:使用一致的标记和分隔符
|
||||
- 信息完整:不遗漏关键上下文
|
||||
- 表达精准:避免歧义和模糊表述
|
||||
|
||||
## Techniques
|
||||
|
||||
- 使用编号列表组织要点
|
||||
- 使用标题区分不同部分
|
||||
- 使用表格呈现结构化数据
|
||||
- 使用引用标记重要信息
|
||||
|
||||
## Connections
|
||||
- [[Prompt能力]] ← enables ← [[结构化表达]]
|
||||
- [[结构化思维]] ← implements ← [[结构化表达]]
|
||||
63
wiki/concepts/记忆系统.md
Normal file
63
wiki/concepts/记忆系统.md
Normal file
@@ -0,0 +1,63 @@
|
||||
---
|
||||
title: "记忆系统"
|
||||
type: concept
|
||||
tags: [memory, openclaw, context]
|
||||
last_updated: 2026-04-14
|
||||
---
|
||||
|
||||
# 记忆系统
|
||||
|
||||
AI Agent跨会话保留上下文和知识的能力。
|
||||
|
||||
## OpenClaw记忆机制
|
||||
|
||||
### 内置方案(builtin)
|
||||
原始记忆存储在Markdown文件中,系统维护本地索引方便检索。
|
||||
|
||||
### QMD方案
|
||||
围绕workspace中的Markdown文件,使用更强的检索/索引方式。
|
||||
|
||||
### 记忆流程
|
||||
```
|
||||
对话发生
|
||||
↓
|
||||
Agent通过普通文件工具把重要信息写入memory/或MEMORY.md
|
||||
↓
|
||||
下次对话开始
|
||||
↓
|
||||
Agent通过memory_search/memory_get检索相关记忆
|
||||
↓
|
||||
相关记忆被注入到当前对话上下文
|
||||
↓
|
||||
Agent表现出"我记得你说过……"的能力
|
||||
```
|
||||
|
||||
## 向量语义搜索
|
||||
|
||||
[[Semantic Memory Search]]使用memsearch:
|
||||
- 索引Markdown记忆文件到向量数据库
|
||||
- 通过含义搜索而非关键词
|
||||
- SHA-256内容哈希避免重复嵌入
|
||||
|
||||
## Workspace记忆文件
|
||||
|
||||
- [[memory/]]:按日期滚动的记忆笔记
|
||||
- [[MEMORY.md]]:长期知识总表
|
||||
- 与memory/目录兼容
|
||||
|
||||
## 关键洞察
|
||||
|
||||
- 对Agent来说,真正算数的长期记忆是Markdown文件
|
||||
- 向量索引只是派生缓存,可以随时重建
|
||||
- 文件永不修改
|
||||
|
||||
## 使用场景
|
||||
|
||||
- [[第二大脑]]
|
||||
- [[个人CRM]]
|
||||
- [[健康症状追踪]]
|
||||
|
||||
## 相关工具
|
||||
|
||||
- [[memsearch]]:向量语义搜索工具
|
||||
- [[Milvus]]:向量数据库后端
|
||||
36
wiki/concepts/需求拆解.md
Normal file
36
wiki/concepts/需求拆解.md
Normal file
@@ -0,0 +1,36 @@
|
||||
---
|
||||
id: requirement-decomposition
|
||||
title: "需求拆解"
|
||||
type: concept
|
||||
tags: [prompt-engineering, structured-thinking]
|
||||
sources:
|
||||
- "[[如何写出完美的Prompt]]"
|
||||
last_updated: 2025-12-02
|
||||
---
|
||||
|
||||
## Definition
|
||||
|
||||
需求拆解是将模糊目标转化为具体可执行子任务的过程。
|
||||
|
||||
## Methods
|
||||
|
||||
### 基础方法
|
||||
- **需求拆解法**:将复杂任务分解为简单步骤
|
||||
- **上下文补全法**:补充背景信息让AI理解场景
|
||||
- **格式定义法**:明确输出格式要求
|
||||
- **示例引导法**:提供参考案例
|
||||
|
||||
### 进阶策略
|
||||
- **思维链引导**:让AI逐步推理
|
||||
- **任务拆分**:大任务分解为子任务
|
||||
- **角色赋能**:赋予AI特定专业角色
|
||||
- **预填回复**:提供初始回答框架
|
||||
|
||||
### 高阶技巧
|
||||
- **跨模态联动**:结合多种输入输出形式
|
||||
- **领域知识注入**:嵌入专业知识
|
||||
- **反馈循环嵌入**:建立迭代优化机制
|
||||
|
||||
## Connections
|
||||
- [[Prompt能力]] ← requires ← [[需求拆解]]
|
||||
- [[结构化表达]] ← enables ← [[需求拆解]]
|
||||
29
wiki/concepts/音频概览.md
Normal file
29
wiki/concepts/音频概览.md
Normal file
@@ -0,0 +1,29 @@
|
||||
---
|
||||
id: audio-overview
|
||||
title: "音频概览"
|
||||
type: concept
|
||||
tags: [NotebookLM, learning, podcast]
|
||||
sources:
|
||||
- "[[7 ways I use NotebookLM to make my life easier]]"
|
||||
last_updated: 2025-11-23
|
||||
---
|
||||
|
||||
## Definition
|
||||
|
||||
音频概览是NotebookLM的功能,将文档转化为AI双人播客格式,适合被动学习。
|
||||
|
||||
## Mechanism
|
||||
|
||||
- AI分析文档内容
|
||||
- 生成两个AI声音的对话
|
||||
- 用户可以收听而非阅读
|
||||
|
||||
## Use Cases
|
||||
|
||||
- 通勤时学习
|
||||
- 视觉疲劳时继续学习
|
||||
- 将长文档转化为可听的摘要
|
||||
|
||||
## Connections
|
||||
- [[NotebookLM]] ← implements ← [[音频概览]]
|
||||
- [[被动学习]] ← uses ← [[音频概览]]
|
||||
37
wiki/concepts/首尾针动画.md
Normal file
37
wiki/concepts/首尾针动画.md
Normal file
@@ -0,0 +1,37 @@
|
||||
---
|
||||
id: frame-interpolation
|
||||
title: "首尾针动画"
|
||||
type: concept
|
||||
tags: [video, AI, animation]
|
||||
sources:
|
||||
- "[[固定镜头短视频AI全流程制作]]"
|
||||
last_updated: 2025-03-15
|
||||
---
|
||||
|
||||
## Definition
|
||||
|
||||
首尾针动画是通过AI自动补齐首尾帧之间中间动作的技术,实现平滑过渡效果。
|
||||
|
||||
## Mechanism
|
||||
|
||||
1. 确定起始帧和结束帧
|
||||
2. AI分析首尾帧的差异
|
||||
3. 自动生成中间过渡帧
|
||||
4. 输出流畅视频
|
||||
|
||||
## Tools
|
||||
|
||||
- 海螺AI
|
||||
- KAI
|
||||
- 其他AI动效工具
|
||||
|
||||
## Connection to Fixed-Camera Video
|
||||
|
||||
固定机位视频天然适合首尾针动画,因为:
|
||||
- 背景固定,减少AI生成负担
|
||||
- 只需关注主体变化
|
||||
- 更容易保持一致性
|
||||
|
||||
## Connections
|
||||
- [[AI视频制作]] ← uses ← [[首尾针动画]]
|
||||
- [[固定机位]] ← enables ← [[首尾针动画]]
|
||||
Reference in New Issue
Block a user