DevOps Engineer Roadmap
Complete 10-phase roadmap from Linux basics to cloud-native engineering. Docker, Kubernetes, AWS, CI/CD, Terraform, monitoring, and security.
Linux & Networking Foundations
The bedrock of all DevOps work
What to learn
- File system navigation, permissions, users
- Process management (ps, top, kill, systemd)
- Networking basics (TCP/IP, DNS, HTTP, SSH)
- Shell scripting (bash, variables, loops, functions)
- Package managers (apt, yum, brew)
- Text processing (grep, awk, sed, jq)
Key tools
Git & Version Control
Collaborate on code like a pro
What to learn
- Commits, branches, merging, rebasing
- Pull requests and code review workflow
- Git hooks and conventional commits
- Monorepo vs multi-repo strategies
- Resolving merge conflicts
Key tools
Resources
Containers with Docker
Package and run anything, anywhere
What to learn
- Containers vs VMs — how Docker works
- Writing efficient multi-stage Dockerfiles
- Image layers, caching, and optimization
- Docker Compose for local dev environments
- Container networking and volumes
- Container security best practices
Key tools
Kubernetes
Orchestrate containers at scale
What to learn
- Control plane and worker node architecture
- Pods, Deployments, StatefulSets, DaemonSets
- Services, Ingress, NetworkPolicies
- ConfigMaps, Secrets, and resource limits
- RBAC, ServiceAccounts, PodSecurityAdmission
- HPA, VPA, Cluster Autoscaler
- Persistent Volumes and StorageClasses
Key tools
CI/CD Pipelines
Automate build, test, and deploy
What to learn
- Pipeline design — stages, jobs, artifacts
- GitHub Actions workflows and reusable workflows
- Jenkins pipelines as code (Jenkinsfile)
- Caching, parallelism, and matrix builds
- Secret management in pipelines
- GitOps with ArgoCD / Flux
- Progressive delivery: canary, blue-green
Key tools
Resources
Cloud — AWS
Deploy and scale on the world's largest cloud
What to learn
- IAM, STS, roles, policies, least-privilege
- VPC, subnets, security groups, NACLs
- EC2, Auto Scaling Groups, ALB/NLB
- S3, lifecycle policies, versioning
- EKS — cluster setup, IRSA, node groups
- Lambda, API Gateway, serverless patterns
- RDS, ElastiCache, DynamoDB basics
Key tools
Infrastructure as Code
Automate your entire infrastructure
What to learn
- Terraform — providers, modules, state, workspaces
- Terraform Cloud / S3 remote state + locking
- Helm charts for Kubernetes deployments
- Kustomize — overlays and patches
- Ansible for configuration management
- Policy as code — OPA, Sentinel
Key tools
Resources
Monitoring & Observability
Know what's happening in production
What to learn
- The 3 pillars: metrics, logs, traces
- Prometheus + Grafana for metrics
- Alertmanager and on-call runbooks
- ELK / Loki stack for log aggregation
- Distributed tracing with Jaeger / Tempo
- SLOs, SLAs, error budgets — SRE concepts
Key tools
Security & DevSecOps
Shift security left — bake it in from day one
What to learn
- SAST / DAST — scan code and running apps
- Container image scanning (Trivy, Snyk)
- Secret scanning — prevent credential leaks
- Kubernetes security hardening (OPA/Gatekeeper)
- Network policies and service mesh (Istio, Cilium)
- Compliance as code — CIS benchmarks
Key tools
AI/MLOps & Platform Engineering
The cutting edge of modern DevOps
What to learn
- ML pipeline orchestration (Kubeflow, Airflow)
- Model serving and inference (Triton, KServe)
- GPU workloads on Kubernetes
- Internal Developer Platforms (Backstage)
- Platform engineering and golden paths
- Cost engineering and FinOps
Key tools
Frequently Asked Questions
Common questions about the DevOps Engineer roadmap