- Sources: 5个新文档 - Concepts: ProxyChains, SOCKS5代理, Docker Daemon代理 - Index: 更新至 Batch 9 - 累计 sources: 108/182
31 lines
1.2 KiB
Markdown
31 lines
1.2 KiB
Markdown
---
|
||
title: "Playwright"
|
||
type: concept
|
||
tags: [browser-automation, testing, scraping, python]
|
||
date: 2025-09-29
|
||
---
|
||
|
||
## Definition
|
||
Playwright,Microsoft 开源浏览器自动化工具,支持 Chromium、Firefox、WebKit 三大渲染引擎,通过一致的 API 控制真实浏览器加载动态内容。
|
||
|
||
## Key Properties
|
||
- **三大引擎**:Chromium(Chrome)、Firefox、WebKit(Safari),跨浏览器一致性测试
|
||
- **无头模式(Headless)**:`playwright install chromium` 安装无头浏览器,无需图形界面
|
||
- **API 风格**:同步(sync_api)+异步(async_api)两套接口
|
||
- **自动等待**:Playwright 自动等待元素可操作后才执行操作,减少 flaky tests
|
||
- **scrapy-playwright**:将 Playwright 注册为 Scrapy 下载器中间件,处理 JavaScript 动态渲染页面
|
||
|
||
## Use Cases
|
||
- 动态网页爬取(JavaScript 渲染内容)
|
||
- 端到端测试(E2E Testing)
|
||
- 截图和 PDF 生成
|
||
- 自动化填表和交互
|
||
|
||
## Related Concepts
|
||
- [[Scrapy]]:通过 scrapy-playwright 集成作为动态内容爬取解决方案
|
||
- [[浏览器自动化]]:Playwright 属于浏览器自动化工具类别
|
||
- [[Playwright]](Entity):工具开发方 Microsoft
|
||
|
||
## Source
|
||
[[Scrapy-Playwright-抓取TikTok-Shop-Data]]
|