Files
nexus/wiki/concepts/StackSets-Deployment-Visibility.md
2026-04-22 04:03:04 +08:00

2.5 KiB
Raw Blame History

title, type, tags, date
title type tags date
StackSets Deployment Visibility concept
AWS
CloudFormation
StackSets
Observability
CloudOps
2025-10-24

Definition

StackSets Deployment VisibilityStackSets 部署可观测性)是指在 AWS 多账户/多区域场景下,通过 EventBridge + CloudWatch Logs 实现对 CloudFormation StackSets 部署状态的集中监控和故障排查能力。核心目标是消除多账户部署中的监控盲区,提供跨账户的统一可观测性视图。

Core Properties

  • 事件捕获EventBridge Rules 捕获所有 CloudFormation 操作事件CREATE/UPDATE/DELETE
  • 跨账户转发EventBridge Custom Event Bus 将事件从成员账户转发到管理账户
  • 集中存储CloudWatch Log Group 聚合所有账户的 CloudFormation 日志
  • 统一查询CloudWatch Logs Insights 支持跨账户、跨区域的结构化日志分析

Event Flow

Member Account CloudFormation (CREATE/UPDATE/DELETE)
  → EventBridge Rule (pattern: CloudFormation events)
  → Event Bus (Custom, in Management Account)
  → CloudWatch Log Group (central-cloudformation-logs)
  → CloudWatch Logs Insights (aggregated queries)

Monitorable Events

  • Stack CREATE operation started/completed/failed
  • Stack UPDATE operation started/completed/failed
  • Stack DELETE operation started/completed/failed
  • Resource creation/update/deletion events
  • Stack set operation preferences (parallelism, fault tolerance)

Query Patterns (CloudWatch Logs Insights)

fields @timestamp, account, region
| parse @message /"resource-type":"(?<resource_type>[^"]+)"/ 
| parse @message /"status":"(?<status>[^"]+)"/ 
| parse @message /"logical-resource-id":"(?<logical_resource_id>[^"]+)"/ 
| filter status = "FAILED"
| sort @timestamp desc

Key Metrics to Track

  • Deployment success/failure rate by account
  • Time-to-deploy by resource type
  • Regional distribution of deployments
  • Failed operations and affected accounts
  • Deployment timeline and operation duration