Files
nexus/wiki/concepts/Content-Aggregation.md
2026-04-27 20:02:52 +08:00

29 lines
1.1 KiB
Markdown
Raw Blame History

This file contains ambiguous Unicode characters
This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.
---
title: "Content-Aggregation"
type: concept
tags: [RSS, Data-Pipeline, Information-Retrieval]
sources: [multi-source-tech-news-digest.md]
last_updated: 2026-04-27
---
# Content-Aggregation
内容聚合——将来自多个异构来源的信息统一收集、去重、标准化后呈现的机制,是解决信息碎片化问题的核心手段。
## Definition
从多个来源RSS、社交媒体、API、Web 抓取等)收集内容,通过合并、去重、排序等处理,最终生成统一的结构化输出。
## Key Characteristics
- **多来源合并**支持不同协议和格式RSS/Atom、JSON API、HTML 爬取等)
- **标准化**统一内容格式标题、摘要、URL、时间戳、来源标签
- **时序整合**:按时间线重新排序跨来源的内容
- **质量分层**:按来源权威性、用户偏好等对内容分级
## Related Concepts
- [[Content-Deduplication]]:内容聚合的前置步骤
- [[Quality-Scoring]]:内容聚合的后置筛选
- [[RSSHub]]:生成标准化 RSS 的工具,使不原生支持 RSS 的来源可被聚合