Auto-sync: 2026-04-28 12:03
This commit is contained in:
@@ -1,46 +1,35 @@
|
||||
---
|
||||
title: "Playwright"
|
||||
type: entity
|
||||
tags: ["browser-automation", "web-scraping", "playwright", "javascript"]
|
||||
sources: ["marketing-carousel-growth-engine"]
|
||||
last_updated: 2026-04-26
|
||||
tags: [browser-automation, open-source, microsoft]
|
||||
sources: [可自动化-可扩展-ai增强的电商数据采集与处理系统]
|
||||
last_updated: 2025-11-11
|
||||
---
|
||||
|
||||
## Definition
|
||||
|
||||
微软出品的浏览器自动化框架,支持 Chromium/Firefox/WebKit 多浏览器引擎。在 [[marketing-carousel-growth-engine]] 中用于抓取任意网站 URL,提取品牌信息和内容。
|
||||
|
||||
## Key Details
|
||||
|
||||
- **Engine**: Playwright with Chromium(完整 JavaScript 渲染页面抓取)
|
||||
- **Usage**: 导航目标 URL 及内部页面(pricing、features、about、testimonials)
|
||||
- **Script**: `analyze-web.js` 执行完整业务研究,输出 `analysis.json`
|
||||
- **Requires**: `playwright install chromium`
|
||||
|
||||
## Output (analysis.json)
|
||||
|
||||
- **品牌提取**: 名称、Logo、配色、字体、Favicon
|
||||
- **内容分析**: 标题、标语、功能、定价、客户评价、数据、CTA
|
||||
- **内部页面导航**: pricing / features / about / testimonials
|
||||
- **竞品检测**: 从网站内容识别 20+ 已知 SaaS 竞品
|
||||
- **业务分类**: 业务类型和细分市场识别
|
||||
- **钩子生成**: 针对细分市场的痛点钩子
|
||||
- **视觉上下文**: 用于幻灯片生成的视觉参考定义
|
||||
|
||||
## Usage in Carousel Growth Engine Pipeline
|
||||
|
||||
```
|
||||
Phase 2: Research & Analyze
|
||||
1. analyze-web.js → Playwright 完整分析目标 URL
|
||||
2. 品牌提取(颜色/字体/Logo/Favicon)
|
||||
3. 内容挖掘(功能/评价/定价/CTA)
|
||||
4. 细分检测(业务类型分类)
|
||||
5. 竞品映射(竞品识别)
|
||||
↓
|
||||
Phase 3: Generate & Verify
|
||||
Gemini 根据 analysis.json 提示词生成幻灯片
|
||||
```
|
||||
|
||||
## Aliases
|
||||
- Playwright
|
||||
- playwright
|
||||
- Microsoft Playwright
|
||||
- playwright-python
|
||||
- playwright-node
|
||||
|
||||
## Summary
|
||||
微软开源的浏览器自动化工具,支持 Chromium/Firefox/WebKit,提供可靠动态页面渲染能力。
|
||||
|
||||
## Description
|
||||
Playwright 是微软开发的浏览器自动化框架,支持多种编程语言(Python、Node.js)。
|
||||
|
||||
### 核心特性
|
||||
- 跨浏览器支持(Chromium、Firefox、WebKit)
|
||||
- 可靠等待机制,避免时序问题
|
||||
- 自动截图、PDF 生成
|
||||
- 无头模式和有头模式
|
||||
- 模拟移动端视图
|
||||
|
||||
### 电商场景适用性
|
||||
解决 JavaScript 渲染页面的抓取问题(如电商网站的无限滚动、懒加载、登录墙)。
|
||||
|
||||
## Use Cases
|
||||
- [[可自动化-可扩展-ai增强的电商数据采集与处理系统]] — 动态页面渲染解决方案
|
||||
|
||||
## Connections
|
||||
- [[Scrapy]] — scrapy-playwright 插件集成
|
||||
|
||||
|
||||
Reference in New Issue
Block a user