Files
nexus/wiki/concepts/Problem-Management.md
2026-04-22 04:03:04 +08:00

64 lines
2.2 KiB
Markdown
Raw Blame History

This file contains ambiguous Unicode characters
This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.
---
title: "Problem Management"
type: concept
tags: [itsm, incident-management, operations]
date: 2025-03-01
---
## Definition
问题管理Problem Management是[[ITSM]]的核心流程之一,专注于**识别和分析IT服务问题的根本原因**防止同类事件重复发生。与事件管理Incident Management处理症状不同问题管理处理的是根本原因。
## Problem Management vs Incident Management
| 维度 | 事件管理 | 问题管理 |
|------|---------|---------|
| 目标 | 快速恢复服务 | 消除根本原因 |
| 处理 | 症状 | 根因 |
| KPI | MTTR | 问题消除率 |
| 时效 | 即时 | 中长期 |
## Problem Management Process
```
┌──────────────┐ ┌──────────────┐ ┌──────────────┐
│ Problem │ → │ Root Cause │ → │ Known Error │
│ Detection │ │ Analysis │ │ Document │
└──────────────┘ └──────────────┘ └──────────────┘
↓ ↓ ↓
AI Anomaly ML-enhanced Known Error
Detection RCA Process Database (KEDB)
```
## Modern Problem Management (ITSM 2.0)
在[[ITSM 2.0]]中问题管理由AI驱动
### AI-Driven Features
- **Anomaly Detection** — 自动识别异常模式
- **Predictive Analytics** — 预测潜在问题
- **ML-enhanced RCA** — 机器学习加速根因分析
- **Automated KEDB Updates** — 自动更新已知错误库
## Key Metrics
| 指标 | 描述 |
|------|------|
| Problem Resolution Rate | 问题解决率 |
| Mean Time to Diagnose (MTTD) | 平均诊断时间 |
| Recurring Incidents | 重复发生事件数 |
| Known Error Accuracy | 已知错误准确率 |
## Related Concepts
- [[ITSM]] — 父框架
- [[Incident-Management]] — 事件管理
- [[Root-Cause-Analysis]] — 根因分析
- [[AIOps]] — AI驱动的分析能力
- [[MTTD]] — 平均诊断时间
- [[Event-Correlation]] — 事件关联
## Sources
- [[understanding-complete-itsm]] — AI-driven Problem Management