What is Error Budget?
The acceptable amount of downtime or errors before an SLO is breached.
An Error Budget is the difference between 100% reliability and your SLO target. If your SLO is 99.9% availability, your error budget is 0.1% of the time period — about 43 minutes per month. Error budgets make reliability a business conversation: if the budget is healthy, you can ship features faster; if it's exhausted, you must prioritize stability. When the budget is consumed, you stop new features and focus on reliability work.
Deep Dive Guide
dora metrics will define devops teams
Related Terms
More Monitoring Terms
AlertManager
Prometheus component that handles alert routing, grouping, and notification delivery.
Grafana
An open-source analytics and visualization platform for metrics, logs, and traces.
Loki
Grafana's horizontally scalable log aggregation system inspired by Prometheus.
Observability
The ability to understand the internal state of a system from its external outputs.
OpenTelemetry
An open-source observability framework for generating metrics, logs, and traces.
Prometheus
An open-source monitoring system that scrapes and stores time-series metrics.
Test your knowledge of Error Budget and 130 other DevOps concepts