Files
nexus/wiki/sources/Personal-Knowledge-Base-RAG.md

46 lines
1.9 KiB
Markdown
Raw Blame History

This file contains ambiguous Unicode characters
This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.
---
title: "Personal Knowledge Base (RAG)"
type: source
tags: [agent, rag, knowledge-base, memory]
date: 2026-04-15
---
## Source File
- [[raw/Agent/usecases/knowledge-base-rag.md]]
## Summary
- 核心主题:语义可搜索个人知识库,自动从任意 URL文章/tweets/YouTube/PDF摄取内容
- 问题域:书签堆积无法检索,看过的内容找不到
- 方法/机制Drop URL 到 Telegram/Slack → 自动抓取内容 → 向量嵌入 → 语义搜索返回 ranked 结果+来源
- 结论/价值RAG 驱动的个人第二大脑;其他工作流可查询知识库获取相关已存内容
## Key Claims
- Drop any URL 自动摄取:文章/tweets/YouTube transcripts/PDFs
- 语义搜索:"What did I save about agent memory?" 返回 ranked 结果+来源引文
- 喂入其他工作流:视频创意管线构建 research cards 时自动查询知识库
- Zero maintenanceURL 即摄入触发器
## Key Quotes
> "You read articles, tweets, and watch videos all day but can never find that one thing you saw last week." — 核心痛点
## Key Concepts
- [[RAG]]Retrieval-Augmented Generation基于向量嵌入的语义检索
- [[个人知识库]]:第二大脑,内容自动积累+语义检索
- [[语义搜索]]:自然语言查询,返回 ranked 相关结果
- [[内容摄取]]URL → 结构化文本 → 向量嵌入全流程
## Key Entities
## Connections
- [[NotebookLM]] ← 共享 source-grounding 理念
- [[向量数据库]] ← 底层存储检索基础设施
- [[Embedding]] ← 语义表示核心机制
- [[Agentic AI]] ← 自主触发知识库查询
- [[Content Factory]] ← 可查询知识库获取背景信息
## Contradictions
- 与传统书签/笔记工具冲突:
- 当前观点:语义搜索+自动摄取优于手动书签整理
- 对方观点:手动整理确保质量和结构
- 结论:摄取自动化+人工审核结合最优