Update nexus: fix conflicts and sync local changes

2026-04-26 12:06:50 +08:00
parent 191797c01b
commit f09834b5a5
2443 changed files with 254323 additions and 255154 deletions
--- a/wiki/concepts/MTTA.md
+++ b/wiki/concepts/MTTA.md
@@ -1,71 +1,71 @@
-# MTTA (Mean Time to Acknowledge)
-
-## Definition
-MTTA (Mean Time to Acknowledge) is the average time from when a problem is detected to when a team member actively begins working on resolving it. It measures the speed of human response after an alert is triggered.
-
-MTTA is a component of MTTR, sitting between MTTD and Mean Time to Repair.
-
-## Why MTTA Matters
-
-MTTA measures:
- On-call response effectiveness
- Alert severity and clarity
- Incident management process efficiency
- Team availability and readiness
-
-A short MTTA ensures that once a problem is detected, the recovery process begins promptly.
-
-## Across DevOps Maturity Levels
-
-| Maturity | Acknowledgment Capability |
-|----------|--------------------------|
-| Phase 1 | Long MTTA — unclear ownership, manual processes, reactive responses |
-| Phase 2 | Improving — essential monitoring alerts team when issues affect users, ops staff manually intervene |
-| Phase 3 | Better process — ops team adopts automation techniques, but monitoring unchanged |
-| Phase 4 | Efficient acknowledgment — continuous monitoring with clear escalation paths, root cause analysis starts quickly |
-| Phase 5 | Rapid — high collaboration, rapid data-driven decision-making, minimal customer interruptions |
-
-## Key Factors Affecting MTTA
-
-### On-Call Practices
- Clear on-call rotations
- Fast escalation policies
- Adequate staffing levels
- Compensation for on-call duty
-
-### Alert Quality
- Actionable alerts (not noise)
- Clear severity levels
- Sufficient context in alerts
- Pre-configured runbook links
-
-### Incident Response Process
- Clear ownership and accountability
- Pre-defined roles (incident commander, communications lead)
- Escalation procedures
- Communication channels
-
-## MTTA as Part of MTTR
-
-```
-MTTR = MTTD + MTTA + Mean Time to Repair
-```
-
-All three components must be optimized for minimal MTTR. Even with perfect MTTD (instant detection), a long MTTA will result in poor overall recovery times.
-
-## How to Improve MTTA
- Implement PagerDuty, Opsgenie, or similar incident management tools
- Create clear escalation policies
- Practice incident response with regular game days
- Improve alert quality to reduce noise and fatigue
- Ensure adequate on-call coverage
- Pre-build runbooks for common incidents
-
-## Sources
- [[sources/devops-maturity-model-from-traditional-it-to-advanced-devops.md]]
-
-## Related Concepts
- [[concepts/MTTR]]
- [[concepts/MTTD]]
- [[concepts/DORA-Metrics]]
- [[concepts/DevOps-Maturity]]
+# MTTA (Mean Time to Acknowledge)
+
+## Definition
+MTTA (Mean Time to Acknowledge) is the average time from when a problem is detected to when a team member actively begins working on resolving it. It measures the speed of human response after an alert is triggered.
+
+MTTA is a component of MTTR, sitting between MTTD and Mean Time to Repair.
+
+## Why MTTA Matters
+
+MTTA measures:
+- On-call response effectiveness
+- Alert severity and clarity
+- Incident management process efficiency
+- Team availability and readiness
+
+A short MTTA ensures that once a problem is detected, the recovery process begins promptly.
+
+## Across DevOps Maturity Levels
+
+| Maturity | Acknowledgment Capability |
+|----------|--------------------------|
+| Phase 1 | Long MTTA — unclear ownership, manual processes, reactive responses |
+| Phase 2 | Improving — essential monitoring alerts team when issues affect users, ops staff manually intervene |
+| Phase 3 | Better process — ops team adopts automation techniques, but monitoring unchanged |
+| Phase 4 | Efficient acknowledgment — continuous monitoring with clear escalation paths, root cause analysis starts quickly |
+| Phase 5 | Rapid — high collaboration, rapid data-driven decision-making, minimal customer interruptions |
+
+## Key Factors Affecting MTTA
+
+### On-Call Practices
+- Clear on-call rotations
+- Fast escalation policies
+- Adequate staffing levels
+- Compensation for on-call duty
+
+### Alert Quality
+- Actionable alerts (not noise)
+- Clear severity levels
+- Sufficient context in alerts
+- Pre-configured runbook links
+
+### Incident Response Process
+- Clear ownership and accountability
+- Pre-defined roles (incident commander, communications lead)
+- Escalation procedures
+- Communication channels
+
+## MTTA as Part of MTTR
+
+```
+MTTR = MTTD + MTTA + Mean Time to Repair
+```
+
+All three components must be optimized for minimal MTTR. Even with perfect MTTD (instant detection), a long MTTA will result in poor overall recovery times.
+
+## How to Improve MTTA
+- Implement PagerDuty, Opsgenie, or similar incident management tools
+- Create clear escalation policies
+- Practice incident response with regular game days
+- Improve alert quality to reduce noise and fatigue
+- Ensure adequate on-call coverage
+- Pre-build runbooks for common incidents
+
+## Sources
+- [[sources/devops-maturity-model-from-traditional-it-to-advanced-devops.md]]
+
+## Related Concepts
+- [[concepts/MTTR]]
+- [[concepts/MTTD]]
+- [[concepts/DORA-Metrics]]
+- [[concepts/DevOps-Maturity]]