nexus/wiki/sources/testing-performance-benchmarker.md at b40abbcd473a7093d8261e212e3d6de97c1e516a

ishenwei/nexus

Fork 0

Files

weishen 111bc65b7b Update nexus wiki content

2026-05-03 05:42:12 +08:00

2.7 KiB

Raw Blame History

title, type, tags, date

title

type

Source File

Agent/agency-agents/testing/testing-performance-benchmarker.md

Summary（用中文描述）

核心主题：AI Agent 角色定义 —— Performance Benchmarker（性能基准测试专家）
问题域：系统性能测试、Web 性能优化、容量规划与可扩展性评估
方法/机制：通过数据驱动的性能测试方法，建立基线、识别瓶颈、提供优化建议并持续监控
结论/价值：为所有系统设定性能 SLA 标准（95% 置信度），通过 Core Web Vitals 和负载测试确保用户体验

Key Claims（用中文描述）

所有系统必须满足性能 SLA，要求 95% 置信度（Performance-First Methodology）
Core Web Vitals 达标标准：LCP < 2.5s、FID < 100ms、CLS < 0.1
性能优化前必须建立基线，并使用统计分析和置信区间进行测量
优先考虑用户感知性能，而非单纯的技术指标

Key Quotes

"Always establish baseline performance before optimization attempts" — 性能测试核心原则 "95th percentile response time improved from 850ms to 180ms through query optimization" — 数据驱动的优化案例 "Page load time reduction of 2.3 seconds increases conversion rate by 15%" — 用户体验影响量化

Key Concepts

PerformanceBenchmarker：AI Agent 角色，专注于测量、分析和改进系统性能，通过数据驱动的方法确保系统满足性能 SLA
CoreWebVitals：Web 性能核心指标，包括 LCP（最大内容绘制）、FID（首次输入延迟）、CLS（累计布局偏移），是衡量用户体验的关键标准
LoadTesting：负载测试，通过模拟真实用户行为在不同负载条件下测试系统性能，识别瓶颈
CapacityPlanning：容量规划，基于增长预测和Usage模式预测资源需求，评估水平/垂直扩展能力
ErrorBudget：错误预算，在可靠性工程中允许一定范围的故障时间，用于平衡创新与稳定性
RealUserMonitoring：真实用户监控（RUM），通过采集真实用户数据而非合成测试来衡量 Web 性能

Key Entities

k6：开源负载测试工具，Performance Benchmarker 在示例代码中使用的性能测试框架，支持多协议和自定义指标
Lighthouse：Google 开发的 Web 性能审计工具，可用于测量 Core Web Vitals 分数

Connections

TestingRealityChecker ← tests → PerformanceBenchmarker
TestingWorkflowOptimizer ← optimizes → PerformanceBenchmarker
APITester ← complements → PerformanceBenchmarker
TestingToolEvaluator ← evaluates → PerformanceBenchmarker

Contradictions

无已知冲突

2.7 KiB Raw Blame History Unescape Escape

Source File

Summary（用中文描述）

Key Claims（用中文描述）

Key Quotes

Key Concepts

Key Entities

Connections

Contradictions

2.7 KiB

Raw Blame History