Update nexus wiki content
This commit is contained in:
@@ -1,44 +1,55 @@
|
||||
---
|
||||
title: "Prometheus"
|
||||
type: entity
|
||||
tags: [monitoring, time-series, devops, observability]
|
||||
last_updated: 2026-04-26
|
||||
---
|
||||
|
||||
## Prometheus — 开源监控系统与时序数据库
|
||||
|
||||
**官方网址:** https://prometheus.io/
|
||||
|
||||
**类型:** 开源项目 / 监控系统
|
||||
|
||||
**别名:**
|
||||
- prom
|
||||
- Prometheus TSDB
|
||||
|
||||
---
|
||||
|
||||
## Overview
|
||||
|
||||
Prometheus 是由 SoundCloud 开发的开源监控系统,现由 CNCF 托管。采用**拉取(pull)模式**从配置的 targets 收集指标,存储为时间序列数据,支持强大的 PromQL 查询语言和灵活的告警规则引擎。
|
||||
|
||||
**核心特性:**
|
||||
- 多维数据模型(metric + labels)
|
||||
- PromQL 强大查询能力
|
||||
- 拉取模式优于推送(网络可控、无侵入)
|
||||
- HTTP API(易于集成)
|
||||
- Alertmanager 集成
|
||||
|
||||
**典型部署端口:** `9090`(Web UI + API)
|
||||
|
||||
---
|
||||
|
||||
## Used By
|
||||
- [[家庭监控方案-prometheus-grafana-node-exporter-cadvisor-blackbox]]
|
||||
|
||||
## Related Sources
|
||||
- [[家庭监控方案-prometheus-grafana-node-exporter-cadvisor-blackbox]]
|
||||
- [[家庭网络环境概览_2026-04-03]]
|
||||
- [[ctp-topic-8-implementation-of-cloud-monitoring-using-micro-focus-operations-brid]]
|
||||
- [[ctp-topic-60-monitor-aws-using-hyperscale-observability-with-grafana]]
|
||||
- [[ctp-topic-67-cloud-native-observability-using-opentelemetry]]
|
||||
- [[public-cloud-learning-sessions-observability-with-opentelemetry]]
|
||||
---
|
||||
title: "Prometheus"
|
||||
type: entity
|
||||
tags: [Monitoring, Observability, DevOps]
|
||||
sources: [engineering-devops-automator]
|
||||
last_updated: 2026-05-01
|
||||
---
|
||||
|
||||
# Prometheus
|
||||
|
||||
## 基本信息
|
||||
- **类型**:开源监控系统
|
||||
- **开发商**:CNCF(云原生计算基金会)
|
||||
- **官网**:https://prometheus.io
|
||||
|
||||
## 定义
|
||||
Prometheus 是一个开源的系统监控和告警工具包,通过定期抓取(pull)指标数据,提供强大的数据模型、查询语言(PromQL)和告警管理能力。
|
||||
|
||||
## 核心特性
|
||||
- **多维数据模型**:指标名称 + 标签集(key-value pairs)
|
||||
- **PromQL**:强大的指标查询和聚合语言
|
||||
- **主动抓取**:通过 HTTP 定期拉取指标,而非被动接收
|
||||
- **告警管理**:与 AlertManager 集成,支持分组、抑制和静默
|
||||
- **服务发现**:自动发现监控目标,支持 Kubernetes、DNS 等
|
||||
|
||||
## 在 DevOps Automator 中的角色
|
||||
- 监控告警体系的核心组件
|
||||
- 通过告警规则(如 HighErrorRate、HighResponseTime)实现主动问题发现
|
||||
- 与 Grafana 集成提供可视化仪表板
|
||||
|
||||
## 关键告警示例
|
||||
```yaml
|
||||
alert: HighErrorRate
|
||||
expr: rate(http_requests_total{status=~"5.."}[5m]) > 0.1
|
||||
for: 5m
|
||||
labels:
|
||||
severity: critical
|
||||
annotations:
|
||||
summary: "High error rate detected"
|
||||
```
|
||||
|
||||
## 相关概念
|
||||
- [[Observability]]
|
||||
- [[Grafana]]
|
||||
|
||||
## 相关工具
|
||||
- AlertManager(告警处理和路由)
|
||||
- Grafana(指标可视化)
|
||||
- node-exporter(主机指标)
|
||||
- cAdvisor(容器指标)
|
||||
|
||||
## Aliases
|
||||
- Prometheus
|
||||
- Prometheus Monitor
|
||||
- Prometheus Monitoring
|
||||
|
||||
Reference in New Issue
Block a user