77 lines
4.2 KiB
Markdown
77 lines
4.2 KiB
Markdown
---
|
|
title: Cloud Service Delivery
|
|
tags:
|
|
- cloud
|
|
- devops
|
|
- it-operations
|
|
created: 2026-04-22
|
|
---
|
|
|
|
# Cloud Service Delivery
|
|
|
|
## Definition
|
|
|
|
Cloud Service Delivery encompasses **the entire lifecycle of making cloud services operational, available, secure, performant, and valuable to end-users and customers.**
|
|
|
|
**In essence, Cloud Service Delivery is the bridge between the raw capabilities of cloud technology (IaaS, PaaS, SaaS) and the reliable, secure, performant, and cost-effective services that businesses and users actually consume.**
|
|
|
|
## The Bridge Concept
|
|
|
|
```
|
|
┌─────────────────────────────────────────────────────────────────┐
|
|
│ Cloud Service Delivery │
|
|
│ (The Bridge) │
|
|
│ │
|
|
│ Raw Cloud Capabilities ──────► Business Value for End Users │
|
|
│ (IaaS, PaaS, SaaS) (Reliable, Secure, Performant) │
|
|
└─────────────────────────────────────────────────────────────────┘
|
|
```
|
|
|
|
## 12 Operational Domains
|
|
|
|
1. **Service Provisioning & Deployment** — Setting up cloud infrastructure, automating deployments, configuring services, managing resource allocation and scaling
|
|
2. **Infrastructure Management** — Monitoring health/performance/capacity, patching, managing physical data center aspects, ensuring HA and DR
|
|
3. **Platform Management (PaaS)** — Managing middleware, databases, development tools, runtime environments, platform scalability/security/performance
|
|
4. **Application Operations & Management** — Monitoring app performance, deploying updates, managing configuration and secrets, ensuring scalability and resilience
|
|
5. **Security & Compliance Management** — Implementing security controls (firewalls, IDS/IPS, encryption, IAM), vulnerability scanning, incident response, regulatory compliance
|
|
6. **Performance & Availability Monitoring** — 24/7 monitoring, SLA/SLO tracking, proactive detection, incident response
|
|
7. **Incident & Problem Management** — Responding to alerts, troubleshooting, incident management, problem management (root cause analysis)
|
|
8. **Change & Configuration Management** — Change control, Infrastructure as Code (IaC), testing and rollback plans
|
|
9. **Cost Management & Optimization** — Monitoring consumption, eliminating waste, right-sizing, reserved instances/savings plans
|
|
10. **Customer Onboarding & Support** — User setup, documentation, helpdesk/service desk, billing inquiries
|
|
11. **Service Governance & Lifecycle Management** — Service catalogs, SLAs, service lifecycle, continuous improvement, vendor management
|
|
12. **Backup, Recovery & Disaster Management** — Backup strategies, restore testing, DR plans, failover/failback procedures
|
|
|
|
## Cloud Service Delivery Team Roles
|
|
|
|
- **Cloud Infrastructure Engineer**
|
|
- **Cloud Operation Engineer (DevOps/SRE)**
|
|
- **Cloud Security Specialists**
|
|
- **Cloud Support Engineer**
|
|
- **Cloud FinOps Engineer**
|
|
|
|
## Related Concepts
|
|
|
|
- [[Cloud DevOps Maturity Model]] — Maturity framework for evaluating cloud DevOps capabilities
|
|
- [[AIOps]] — Artificial Intelligence for IT Operations
|
|
- [[SLA]] / [[SLO]] — Service Level Agreements/Objectives
|
|
- [[FinOps]] — Cloud financial management
|
|
- [[DevOps]] — Development and Operations integration
|
|
- [[SRE]] — Site Reliability Engineering
|
|
- [[ITSM]] — IT Service Management
|
|
|
|
## Related Sources
|
|
|
|
- [[what-i-know-about-cloud-service-delivery-1]]
|
|
|
|
## Best Practices
|
|
|
|
| Domain | Best Practice |
|
|
|--------|---------------|
|
|
| Infrastructure Monitoring | AWS CloudWatch as data source in Grafana |
|
|
| Security | Cloud Application WAF management, IP whitelist to tenant level |
|
|
| Availability | APM/BPM, New Relic, AWS CloudWatch Synthetic, Health Page |
|
|
| Uptime | SLA 99.9% vs 99.99% ([uptime.is](https://uptime.is/)) |
|
|
| Alerting | Grafana Alerting with different severity levels |
|
|
| Change Management | Planned Change vs Emergency Change |
|