cheatsheets Jun 28, 2026 updated Jun 28, 2026

Kubernetes Operational Checklist

A small operational checklist for Kubernetes services and AI workloads.

Status
evergreen
Visibility
public
Category
Infrastructure
Difficulty
intermediate
Published
Jun 28, 2026
Updated
Jun 28, 2026

Deployment

  • Requests and limits are set.
  • Readiness and liveness probes exist.
  • Rollout strategy is understood.
  • Config and secrets are separated.
  • Service account permissions are scoped.

Debugging

  • Logs are searchable by deployment, pod, and request ID.
  • Dashboards show error rate, latency, CPU, memory, and restarts.
  • Runbook includes kubectl commands.
  • Rollback command is documented.

AI Workloads

  • GPU scheduling is explicit.
  • Model artifact storage is documented.
  • Warmup behavior is known.
  • Queue depth and job failure rate are visible.

Source Links

Related Notes

Cheat Sheets Jun 28, 2026 intermediate

FastAPI Production Checklist

A compact checklist for taking a FastAPI service from useful prototype to production-ready backend.

Backlinks