Files
nexus/wiki/concepts/语义搜索.md
2026-04-23 04:02:48 +08:00

25 lines
1011 B
Markdown
Raw Blame History

This file contains ambiguous Unicode characters
This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.
---
title: "语义搜索"
type: concept
tags: [ai, search, vector, rag]
sources: [google-神级生产力工具-所有-github-开源平替都找到了, semantic-memory-search]
last_updated: 2026-04-23
---
## Definition
语义搜索Semantic Search是通过理解查询意图和文档语义的搜索方式超越传统关键词匹配基于向量相似度找到语义相关的内容。
## How It Works
1. **向量化**:将文档切分为 chunk通过 Embedding 模型转换为向量
2. **语义匹配**:将用户查询也转为向量,计算与文档向量的相似度
3. **结果排序**:按相似度得分返回最相关的内容
## Limitations
- 对专有名词(如人名、产品名、技术术语)召回率低
- 精确匹配需求场景表现不如 BM25
## Related Concepts
- [[混合搜索]] — 语义搜索 + BM25 的组合,互补各自局限
- [[RAG]] — 语义搜索是 RAG 检索层的核心
- [[重排序]]Re-ranking— 对语义搜索初筛结果进行二次精排