Files
nexus/wiki/sources/testing-evidence-collector.md
2026-04-21 04:02:47 +08:00

3.7 KiB
Raw Blame History

title, type, tags, date
title type tags date
Evidence Collector source
testing
qa
evidence
agency
2026-04-21

Source File

Summary

  • 核心主题QA 证据收集智能体 EvidenceQA 的角色定义与测试方法论
  • 问题域AI Agent 开发中的质量保证流程,避免无证据的"幻想式"报告
  • 方法/机制:基于 Playwright 截图、可复现命令、事实检查的证据驱动 QA
  • 结论/价值:建立现实的质量评估标准,默认发现 3-5 个问题,要求视觉证据

Key Claims

  • 视觉证据是唯一真相:无法截图证明的功能视为不存在
  • 默认发现问题:首次实现总有 3-5+ 个问题,"零问题"是危险信号
  • 一切需证明:每个声明都需要截图证据支撑
  • 诚实质量评估Basic/Good/Excellent 级别,不接受虚假的 A+ 评分

Key Quotes

"Screenshots Don't Lie" — 视觉证据是唯一真相 "Default to Finding Issues" — 首次实现总有 3-5+ 个问题 "Prove Everything" — 每个声明都需要截图证据

Key Concepts

Key Entities

Connections

QA Report Template

# QA Evidence-Based Report

## 🔍 Reality Check Results
**Commands Executed**: [List actual commands run]
**Screenshot Evidence**: [List all screenshots reviewed]
**Specification Quote**: "[Exact text from original spec]"

## 📸 Visual Evidence Analysis
**Comprehensive Playwright Screenshots**: responsive-desktop.png, responsive-tablet.png, responsive-mobile.png, dark-mode-*.png
**What I Actually See**:
- [Honest description of visual appearance]

**Specification Compliance**:
- ✅ Spec says: "[quote]" → Screenshot shows: "[matches]"
- ❌ Spec says: "[quote]" → Screenshot shows: "[doesn't match]"

## 📊 Issues Found (Minimum 3-5)
1. **Issue**: [Specific problem]
   **Evidence**: [Screenshot reference]
   **Priority**: Critical/Medium/Low

## 🎯 Honest Quality Assessment
**Realistic Rating**: C+ / B- / B / B+ (NO A+ fantasies)
**Design Level**: Basic / Good / Excellent
**Production Readiness**: FAILED / NEEDS WORK / READY

## 🔄 Required Next Steps
**Status**: FAILED (default unless overwhelming evidence)
**Re-test Required**: YES

Testing Protocol

Accordion Testing

  • 对比展开前后的截图
  • 验证内容是否正确显示

Form Testing

  • 截图空表单、填写后表单
  • 验证提交、验证、错误提示

Mobile Responsive Testing

  • 1920x1080、768x1024、375x667 三种分辨率
  • 验证汉堡菜单、布局、配色

Dark Mode Testing

  • 验证深色模式切换功能
  • 检查截图中的 dark-mode-*.png

Automatic Fail Triggers

Fantasy Reporting Signs

  • 声称"零问题"
  • 完美评分A+, 98/100
  • 无证据的"豪华/高级"声明
  • 未测试就声称"生产就绪"

Visual Evidence Failures

  • 无法提供截图
  • 截图与声明不符
  • 截图中可见功能损坏
  • 基础样式被声称 为"豪华"