Sync: add kubernetes observability notes
This commit is contained in:
87
wiki/log.md
87
wiki/log.md
@@ -1,3 +1,75 @@
|
||||
## [2026-04-29] ingest | Public Cloud Learning Sessions - OpenText GIS Security Policies - 20241015
|
||||
- Source file: Cloud & DevOps/Public-Cloud-Learning-Sessions/07_Security/public-cloud-learning-sessions-opentext-gis-security-policies-20241015-160257-me.md
|
||||
- Status: ✅ 成功摄入
|
||||
- Summary: OpenText 全球信息安全团队(GIS)安全策略全景——Mike & Ed 主讲。GIS 分层组织架构(安全运营/合规/治理风险验证/隐私);OpenText 分层方法定义安全策略;ISO 27001 姿态框架(2022年更新);Global Information Security Policy(GISP)是最高纲领性政策,季度审查;每月处理 2250 亿条日志,分诊约 350 个案例;FedRAMP 等多项认证支撑多垂直市场销售。
|
||||
- Concepts identified: [[ISO-27001]], [[FedRAMP]], [[Global-Information-Security-Policy]], [[Security-Awareness-Training]], [[Third-Party-Penetration-Testing]], [[Threat-Intelligence]], [[BrightCloud]](均以 wikilink 形式记录于 Source page,各仅出现 1 次,暂不创建独立页面)
|
||||
- Entities identified: [[Mike]](GIS Team 主讲人,仅出现 1 次,以 wikilink 形式记录于 Source page), [[Ed]](GIS Team 主讲人,仅出现 1 次,以 wikilink 形式记录于 Source page)
|
||||
- Source page: wiki/sources/public-cloud-learning-sessions-opentext-gis-security-policies-20241015-160257-me.md
|
||||
- Notes:
|
||||
- 新增 1 个 Source Page(wiki/sources/public-cloud-learning-sessions-opentext-gis-security-policies-20241015-160257-me.md)
|
||||
- index.md 更新:Sources 节新增条目(日期 2026-04-14,置顶于所有条目最前)
|
||||
- overview.md 更新:新增 GIS Security Policies 摘要条目(置于 Thor Platform 之后,CTP Topic 28 之前);Key Concepts 新增 ISO-27001/FedRAMP(已有条目)、BrightCloud 等
|
||||
- Connections 已建立:与 [[ctp-topic-10-aws-landing-zone-lz-data-collection-tagging-related-security]] 建立 related_to 关系
|
||||
- 冲突检测:与 [[ctp-topic-10]] 的互补而非冲突关系已记录于 Source Page Contradictions 节——GISP 定义全局政策纲领,Landing Zone 层面通过标签和 SCP 实现技术落地
|
||||
|
||||
## [2026-04-25] ingest | CTP Topic 64 Scaling out with Amazon EKS
|
||||
- Source file: Cloud & DevOps/Public-Cloud-Learning-Sessions/04_EKS/ctp-topic-64-scaling-out-with-amazon-eks.md
|
||||
- Status: ✅ 成功摄入
|
||||
- Summary: Amazon EKS 工作负载扩缩容完整方法论——Pod 层:HPA(标准指标)+ KEDA(事件驱动);Node 层:Cluster Autoscaler(ASG 联动)+ Karpenter(直接 EC2 API);IP 耗尽解决方案:IPv6 双栈 VPC;集群稳定性:API Server PPF + CoreDNS 扩缩容。Suravpul 主讲。
|
||||
- Concepts identified: [[Horizontal Pod Autoscaler (HPA)]](已在 ctp-topic-59 提及), [[KEDA]](新), [[Cluster Autoscaler]](已在 ctp-topic-70 提及), [[Karpenter]](已在 Part 1 提及)
|
||||
- Entities identified: [[Suravpul]](AWS 高级解决方案架构师,ctp-topic-59/64/67 三专题讲师)
|
||||
- Source page: wiki/sources/ctp-topic-64-scaling-out-with-amazon-eks.md
|
||||
- Notes: 与 ctp-topic-59(EKS 可靠性,HPA/VPA)和 ctp-topic-70(IaC 部署,Cluster Autoscaler)形成互补知识链路。与 Part 3 EKS Auto Mode 共享 Karpenter 知识节点。
|
||||
|
||||
## [2026-04-25] ingest | CTP Topic 67 Cloud native observability using OpenTelemetry
|
||||
- Source file: Cloud & DevOps/Public-Cloud-Learning-Sessions/04_EKS/ctp-topic-67-cloud-native-observability-using-opentelemetry.md
|
||||
- Status: ✅ 成功摄入
|
||||
- Summary: AWS 解决方案架构师 Surav 分享的 EKS/ECS 云原生可观测性深度实践。涵盖可观测性三信号模型(Traces/Metrics/Logs)、OpenTelemetry Collector 架构(Receivers → Processors → Exporters)、ADOT 的多种 EKS/ECS 部署模式。核心观点:构建可观测的应用是开发者的责任;Trace 捕获调用栈各层处理耗时;Correlation ID 实现跨信号关联。
|
||||
- Concepts identified: [[OpenTelemetry]], [[Three Signals]], [[SIGV4 Auth Extension]], [[Correlation ID]]
|
||||
- Source page: wiki/sources/ctp-topic-67-cloud-native-observability-using-opentelemetry.md
|
||||
- Notes: 与 ctp-topic-60(Hyperscale Observability with Grafana)同属可观测性专题,与 public-cloud-learning-sessions-observability-with-opentelemetry-20240402 同属 OpenTelemetry 主题
|
||||
|
||||
## [2026-04-24] ingest | Public Cloud Learning Sessions - EKS Optimization Part 2 of 3 - Running Containers with Bottlerocket OS
|
||||
- Source file: Cloud & DevOps/Public-Cloud-Learning-Sessions/04_EKS/public-cloud-learning-sessions-eks-optimization-part-2-of-3-running-containers-w.md
|
||||
- Status: ✅ 成功摄入
|
||||
- Summary: Bottlerocket OS(火箭瓶)深度解析——AWS 专为容器工作负载优化的最小化开源 Linux 发行版。核心设计理念:最小化(去除包管理器/Shell/SSH,仅打包必要内核组件)、安全更新(分区镜像 A/B 切换确保原子性)、安全加固(dm-verity 根文件系统加密验证 + SE Linux enforcing 模式 + 根文件系统默认只读)。Variant 机制通过平台+架构+工作负载组件组合在构建时定制功能,支持 Bottlerocket for EKS AMI(自管理节点组)、托管节点组(Managed Node Groups)和 Carpenter 节点池三种集成方式。
|
||||
- Concepts identified: [[Immutable-Root-Filesystem]], [[dm-verity]], [[SE-Linux-Enforcing]], [[Partition-Updates]], [[CIS-Benchmark]]
|
||||
- Entities identified: [[Bottlerocket]], [[Amazon EKS]], [[AWS]]
|
||||
- Source page: wiki/sources/public-cloud-learning-sessions-eks-optimization-part-2-of-3-running-containers-w.md
|
||||
- Notes: EKS 优化三专题 Part 2(Part 1 = Karpenter 计算优化,Part 3 = EKS Auto Mode)。Bottlerocket Entity 和 5 个 Concept 均为新增。Part 3 的 EKS Auto Mode 默认使用 Bottlerocket 作为节点操作系统,形成知识链路补充。
|
||||
|
||||
## [2026-04-24] ingest | CTP Topic 42 Grafana Observability Dashboard
|
||||
- Source file: Cloud & DevOps/Public-Cloud-Learning-Sessions/04_EKS/ctp-topic-42-grafana-observability-dashboard.md
|
||||
- Status: ✅ 成功摄入
|
||||
- Summary: 企业级 Grafana 可观测性平台在 AWS 多账户环境下的架构设计与 Terraform IaC 自动化实践。涵盖 Grafana 核心定位(不存储数据,仅从数据源可视化)、基础设施架构(监控账户部署 Grafana,通过 IAM 角色跨账户访问产品团队 AWS 账户)、用户和团队访问控制、示例仪表盘(CPU/I/O/Network/EBS/Estimated Charges)、告警系统(Microsoft Teams 通知)、Terraform 模块化供给(数据源模块 + 组织模块 + LZSAP 自动化接入)、Prometheus 网络监控(Checkpoint/防火墙 SNMP 指标)。
|
||||
- Concepts identified: [[Observability(可观测性)]], [[Prometheus]], [[SNMP(Simple Network Management Protocol)]], [[IAM Role(跨账户角色)]]
|
||||
- Entities identified: [[AWS CloudWatch]], [[AWS Landing Zone]], [[Micro Focus Operations Bridge Manager]]
|
||||
- Source page: wiki/sources/ctp-topic-42-grafana-observability-dashboard.md
|
||||
- Notes: 该视频与 [[ctp-topic-60]] 均介绍 Grafana,视角互补(Grafana 本身 vs Hyperscale 场景),与 [[ctp-topic-54]] 和 [[ctp-topic-67]] 同属可观测性专题,共同构成监控知识体系。长期目标是构建应用级仪表盘替代 Micro Focus OBM。Entity 和 Concept 已有 Grafana/Prometheus/Terraform/Checkpoint 等,无需新建。
|
||||
|
||||
## [2026-04-25] ingest | CTP Topic 54 ESM SaaS Log Analytics
|
||||
- Source file: Cloud & DevOps/Public-Cloud-Learning-Sessions/04_EKS/ctp-topic-54-esm-saas-log-analytics.md
|
||||
- Status: ✅ 成功摄入
|
||||
- Summary: ITOM ESM SAS 架构师 Jackie 主讲的企业级日志分析解决方案——ELK/OpenSearch 技术栈架构(BEATS/Filebeat → Logstash → Elasticsearch/OpenSearch → Kibana)、双 VPC 隔离架构、Redis 缓冲层、GDPR 合规区域分割。安全:NVMe 静态加密、TLS 1.2、VPC 私有流量、RBAC。方案对比:AWS OpenSearch(~$1,500/月,SLA 99.9%,推荐)vs Logz.io(~$4,000/月,SLA 99.8%)vs 自托管 ELK vs Microfocus OBA。
|
||||
- Concepts identified: [[ELK Stack]], [[OpenSearch]], [[Logstash]], [[Kibana]], [[BEATS]], [[Filebeat]], [[Centralized-Logging]], [[Redis缓存]], [[RBAC]], [[TLS]], [[GDPR]]
|
||||
- Entities identified: [[AWS OpenSearch]], [[Jackie]]
|
||||
- Source page: wiki/sources/ctp-topic-54-esm-saas-log-analytics.md
|
||||
- Notes: 新建 Concept 页面 ELK-Stack.md、BEATS.md;新建 Entity 页面 AWS-OpenSearch.md;已更新 overview.md(Sources 条目 + Key Concepts);Key Concepts 列表中已有 Centralized-Logging、Redis缓存(Redis缓存.md)、TLS,未发现冲突内容
|
||||
|
||||
## [2026-04-26] ingest | CTP Topic 59 Achieving reliability with Amazon EKS
|
||||
- Source file: Cloud & DevOps/Public-Cloud-Learning-Sessions/04_EKS/ctp-topic-59-achieving-reliability-with-amazon-eks.md
|
||||
- Status: ✅ 成功摄入
|
||||
- Summary: Amazon EKS 可靠性最佳实践——Surav Paul(AWS 高级解决方案架构师)主讲。涵盖 ECS vs EKS 选型、可靠性五维度(故障检测/优雅降级/确定性故障/自愈/按需扩缩)、Shared Responsibility Model(Fargate 免除节点管理)、应用层可靠性(AZ 分散/拓扑约束/HPA/VPA/部署策略/健康探针/PodDisruptionBudget)、控制平面可靠性(指标监控/认证加固/Webhook 管理/集群升级)和数据平面可靠性(节点问题检测/资源预留/QoS/配额/Pod 优先级)。
|
||||
- Concepts identified: [[Reliability(系统可靠性)]], [[Application Reliability(应用可靠性)]], [[Control Plane Reliability(控制平面可靠性)]], [[Data Plane Reliability(数据平面可靠性)]], [[Shared Responsibility Model(EKS)]], [[Pod Anti-Affinity]], [[Topology Spread Constraints]], [[Horizontal Pod Autoscaler (HPA)]], [[Vertical Pod Autoscaler (VPA)]], [[Liveness/Readiness/Startup Probes]], [[PodDisruptionBudget]], [[Rolling/Blue-Green/Canary Deployment]](均以 wikilink 形式记录于 Source page;均仅出现 1 次,暂无独立页面)
|
||||
- Entities identified: [[Surav Paul]], [[Amazon EKS]], [[Amazon ECS]], [[AWS Fargate]](均以 wikilink 形式记录于 Source page;仅 [[Amazon EKS]] 在多个页面中反复出现,符合独立页面创建条件,其余仅出现 1 次,暂无独立页面)
|
||||
- Source page: wiki/sources/ctp-topic-59-achieving-reliability-with-amazon-eks.md
|
||||
- Notes:
|
||||
- 新增 1 个 Source Page(wiki/sources/ctp-topic-59-achieving-reliability-with-amazon-eks.md)
|
||||
- index.md 更新:新增 CTP Topic 59 条目于 Sources 节顶部
|
||||
- overview.md 更新:新增 CTP Topic 59 条目于 Cloud Transformation & DevOps → EKS 知识链路
|
||||
- Contradictions 记录:与 ctp-topic-39(EKS Lab LZ 网络部署)存在视角差异——Topic 39 面向受限网络环境的自定义网络方案,Topic 59 提供通用 EKS 可靠性最佳实践,互为补充而非冲突
|
||||
- 无需新建 Concept/Entity 独立页面(所有概念和实体仅在本页面出现 1 次;Amazon EKS 虽在多个其他页面提及,但本页面无新增独立维度,不单独创建)
|
||||
|
||||
## [2026-04-26] ingest | CTP Topic 29 Cloud Monitoring – SaaS LZ accounts
|
||||
- Source file: Cloud & DevOps/Public-Cloud-Learning-Sessions/04_EKS/ctp-topic-29-cloud-monitoring-saas-lz-accounts.md
|
||||
- Status: ✅ 成功摄入
|
||||
@@ -2177,3 +2249,18 @@
|
||||
- index.md 更新:在 Sources 节顶部添加新条目;在 Concepts 节添加 3 个新条目;移除 "source missing" 标记
|
||||
- overview.md 更新:添加新条目,位于 EKS Auto Mode 条目之后
|
||||
- 冲突检测:与 ctp-topic-59-achieving-reliability-with-amazon-eks 可能存在内容重叠(侧重点不同:Topic 70 侧重部署方法,Topic 59 侧重可靠性实践)
|
||||
|
||||
## [2026-04-27] ingest | Public Cloud Learning Sessions - Observability with OpenTelemetry
|
||||
- Source file: Cloud & DevOps/Public-Cloud-Learning-Sessions/04_EKS/public-cloud-learning-sessions-observability-with-opentelemetry-20240402-160113-.md
|
||||
- Status: ✅ 成功摄入
|
||||
- Summary: Jay Comer(AWS 解决方案架构师)主讲 OpenTelemetry 可观测性全景——三信号模型(Metrics/Logs/Traces)、OTLP 协议 + 11 种语言 SDK + Collector 架构、AWS Distribution for OpenTelemetry(统一代理 + EKS Operator 自动注入)、Fluent Bit → OTel Collector(端口 55681)→ Amazon OpenSearch 端到端管道演示。
|
||||
- Concepts created: [[OpenTelemetry]], [[Observability(可观测性)]], [[Three Signals]], [[OTLP(OpenTelemetry Protocol)]], [[Fluent Bit]]
|
||||
- Entities identified: [[Jay Comer]]
|
||||
- Source page: wiki/sources/public-cloud-learning-sessions-observability-with-opentelemetry-20240402-160113.md
|
||||
- Notes:
|
||||
- 新增 1 个 Source Page
|
||||
- index.md 更新:新增条目(日期 2024-04-02)
|
||||
- overview.md 更新:新增条目于 Cloud Transformation & DevOps → EKS 知识链路;Key Concepts 新增 5 个条目
|
||||
- 新增 Entity 页面:Jay-Comer.md
|
||||
- 新增 Concept 页面:OpenTelemetry.md
|
||||
- 冲突检测:与 ctp-topic-54-esm-saas-log-analytics(ELK 日志)、ctp-topic-67(CTP Topic 67 OpenTelemetry)互补,无冲突
|
||||
|
||||
Reference in New Issue
Block a user