Auto-sync: 2026-04-16 17:30
This commit is contained in:
28
wiki/concepts/AI-driven-RCA.md
Normal file
28
wiki/concepts/AI-driven-RCA.md
Normal file
@@ -0,0 +1,28 @@
|
||||
---
|
||||
title: "AI-driven RCA"
|
||||
type: concept
|
||||
tags: [AI, root-cause-analysis, incident-management]
|
||||
sources: [How-Agentic-AI-can-help-for-Cloud-DevOps]
|
||||
last_updated: 2026-04-16
|
||||
---
|
||||
|
||||
## Summary
|
||||
AI-driven RCA(AI 驱动的根因分析)利用机器学习分析日志和指标,自动识别故障根本原因。
|
||||
|
||||
## Definition
|
||||
使用 AI 算法分析来自多个来源的日志、指标和事件数据,自动定位系统故障的根本原因。
|
||||
|
||||
## Key Techniques
|
||||
- **日志关联分析**:跨服务、跨时间关联日志事件
|
||||
- **异常模式识别**:识别与历史 outage 类似的模式
|
||||
- **因果链路推断**:构建故障传播链路,确定因果关系
|
||||
- **多维度分析**:同时分析计算、网络、存储、应用层
|
||||
|
||||
## Tools
|
||||
- [[CloudWatch]](AWS)
|
||||
- [[Stackdriver]]/Cloud Monitoring(GCP)
|
||||
- Azure Monitor(Azure)
|
||||
|
||||
## Connections
|
||||
- [[Agentic AI]] ← uses ← [[AI-driven RCA]]:Agentic AI 集成 RCA 能力
|
||||
- [[MTTR]] ← reduces ← [[AI-driven RCA]]:AI RCA 缩短平均修复时间
|
||||
Reference in New Issue
Block a user