feat(wiki): ingest remaining subdirectories batch (51 files)

- Others: ChinaTextbook, Obsidian笔记系列, YouTube Channel ID, TikTok PM Django
- Skills: GOG CLI, Last30Days, baoyu-skills
- Vibe Coding: Cursor 2.0, Trae远程开发, Vibe-Kanban+OpenCode, vibe coding经验
- 微信公众号: 养虾日记1-5, AI时代赚钱
- 跨境电商: TikTok数据抓取, 选品策略, Superset Dashboard
- AI目录补充: 20个文件

Source pages: 51
Entities: TapXWorld, VibeKanban, OpenCode, Trae, SourceGrounding等
Concepts: 自举Meta生成, 5大设计原则, MD5去重, 混合搜索等
This commit is contained in:
2026-04-14 20:48:34 +08:00
parent 631b34fa88
commit f9ac3145ab
79 changed files with 2920 additions and 153 deletions

View File

@@ -0,0 +1,35 @@
---
title: "TikTok数据抓取"
type: source
tags: []
date: 2025-12-19
---
## Source File
- [[raw/跨境电商/Scrapy + Playwright 抓取TikTok Shop Data.md]]
## Summary
- 核心主题Scrapy+Playwright抓取TikTok Shop数据技术指南
- 问题域:电商数据采集、动态页面抓取
- 方法/机制创建venv安装scrapy+scrapy-playwrightplaywright install chromium
- 结论/价值实现TikTok Shop店铺数据的自动化抓取
## Key Claims
- 推荐创建虚拟环境(venv)隔离依赖避免与系统Python冲突
- scrapy-playwright结合Scrapy和Playwright优势支持动态页面抓取
- Docker容器需额外配置venv环境
## Key Concepts
- [[动态页面抓取]]Playwright处理JavaScript渲染
- [[Python虚拟环境]]venv隔离项目依赖
## Key Entities
- [[Scrapy]]Python爬虫框架
- [[Playwright]]:浏览器自动化工具
- [[TikTokShop]]:电商平台数据来源
## Connections
- [[Scrapy]] ← enhanced_by ← [[Playwright]]
## Contradictions
- 无冲突