Table of Contents

Kubernetes Design & Code Review Checklist

1. Architecture Review

Cluster Design

Namespace Design

Example:

production-payment
production-order
staging-payment
staging-order

2. Workload Review

Workload Type Selection

Requirement Kubernetes Resource
Stateless API Deployment
Background Worker Deployment
Database StatefulSet
Cache StatefulSet
Scheduled Task CronJob
One-time Task Job
Daemon on every node DaemonSet

Checklist:


3. Deployment Review

Deployment Configuration

strategy:
  type: RollingUpdate
  rollingUpdate:
    maxUnavailable: 1
    maxSurge: 1

Labels

Required labels:

labels:
  app: payment-api
  version: v1.2.0
  env: prod
  team: backend

Checklist:


4. Container Review

Container Image

Checklist:

Bad:

image: api:latest

Good:

image: api:v1.3.5

Security Context

Checklist:

securityContext:
  runAsNonRoot: true
  readOnlyRootFilesystem: true
  allowPrivilegeEscalation: false

5. Resource Management

CPU & Memory

Checklist:

resources:
  requests:
    cpu: "250m"
    memory: "256Mi"
  limits:
    cpu: "500m"
    memory: "512Mi"

Autoscaling

Checklist:

Example:

minReplicas: 2
maxReplicas: 10

6. Health Checks

Liveness Probe

Checklist:

livenessProbe:
  httpGet:
    path: /health
    port: 8080

Readiness Probe

Checklist:

readinessProbe:
  httpGet:
    path: /ready
    port: 8080

Startup Probe

Checklist:


7. Networking Review

Service Review

Checklist:

Ingress Review

Checklist:

Network Policies

Checklist:

kind: NetworkPolicy

8. Storage Review

Persistent Volumes

Checklist:

Stateful Applications

Checklist:


9. Configuration Management

ConfigMap

Checklist:

Secret Management

Checklist:


10. Security Review

RBAC

Checklist:

Bad:

cluster-admin

Good:

Role
RoleBinding

Pod Security

Checklist:

Supply Chain Security

Checklist:


11. Reliability Review

High Availability

Checklist:

podAntiAffinity:

Pod Disruption Budget

Checklist:

minAvailable: 1

Graceful Shutdown

Checklist:


12. Observability Review

Logging

Checklist:

Metrics

Checklist:

Tracing

Checklist:


13. CI/CD Review

Deployment Pipeline

Checklist:

GitOps

Checklist:

Deployment Strategies

Checklist:


14. Cost Optimization

Checklist:


15. Disaster Recovery

Backup

Checklist:

Recovery

Checklist:


16. Production Readiness Scorecard

Category Target Score
Security 9/10
Reliability 9/10
Scalability 9/10
Observability 9/10
Maintainability 9/10
Cost Optimization 8/10+
Disaster Recovery 8/10+

Final Production Review Questions

  1. [ ] Will it survive a Pod crash?
  2. [ ] Will it survive a Node crash?
  3. [ ] Will it survive a Zone failure?
  4. [ ] Can it scale automatically?
  5. [ ] Can it be deployed with zero downtime?
  6. [ ] Can it be rolled back safely?
  7. [ ] Is it secure by default?
  8. [ ] Is it observable?
  9. [ ] Can another engineer maintain it?
  10. [ ] Can it run at 3 AM without waking me up?

If all answers are YES, the Kubernetes platform/workload is considered Production Ready.