What does MTTR include?

MTTR typically includes detection time, diagnosis time, repair/fix time, and verification time. Some definitions only include the actual repair phase. Clarify which phases are included in your organization's MTTR definition.

DORA research classifies elite performers as having MTTR under 1 hour. High performers restore service within a day. The target depends on service criticality — payment systems need sub-minute recovery while batch processing can tolerate hours.

How do I reduce MTTR?

Invest in observability (logs, metrics, traces), create detailed runbooks, implement automated remediation for known failure modes, practice incident response, and ensure engineers have appropriate access and tooling. Keeping detailed records of these calculations will streamline future planning and make it easier to track changes over time.

Is MTTR the same as Mean Time to Recovery?

They are often used interchangeably, but some frameworks distinguish them. Mean Time to Repair focuses on the actual fix duration, while Mean Time to Recovery includes the full cycle from failure detection to service restoration.

How does MTTR relate to availability?

Availability = MTBF / (MTBF + MTTR). Reducing MTTR directly improves availability. If MTBF is 1000 hours and MTTR drops from 2 hours to 1 hour, availability improves from 99.8% to 99.9%.

Should I track mean or median for repair time?

Median (p50) is more robust against outliers, but tracking both is valuable. Also track p90 and p95 repair times to understand worst-case scenarios and ensure consistently fast response rather than just average performance.

MTTR Calculator (Mean Time to Repair)

Calculate Mean Time to Repair from total repair time and number of repairs. Measure and improve your incident resolution speed.

Total Repair Time

min

Number of Repairs

Measurement Period

days

Downtime Cost

$/hr

Staff per Incident

Incident Severity

MTTR (minutes)

75.0

1.25 hours per incident

MTTR (hours)

1.25

75 minutes

DORA Tier

High

< 1 day MTTR

Availability

98.9583%

450 min downtime in 30 days

Downtime Cost

$37,500.00

$5,000.00/hr × 1.3 hr × 6 incidents

Severity-Adjusted Cost

$56,250.00

1.5× multiplier for major severity

Annual Projected Cost

$456,250.00

~73 incidents/year extrapolated

Staff Hours Consumed

22.5

3 staff × 1.3 hr × 6 incidents

MTTR vs DORA Targets

75 min

10,080 min

Availability

98.9583%

100%

DORA Metrics Benchmarks

Tier	MTTR	Deploy Frequency	Change Fail Rate	Your Status
Elite	< 1 hour	On-demand	0–15%
High	< 1 day	Daily–Weekly	16–30%	← You are here
Medium	< 1 week	Monthly	16–30%
Low	> 6 months	< 6 months	> 30%

Incident Cost Breakdown

Metric	Value
Base downtime cost/incident	$6,250.00
Total downtime cost	$37,500.00
Severity multiplier (major)	1.5×
Severity-adjusted total	$56,250.00
Annual projected incidents	73
Annual projected cost	$456,250.00

Planning notes, formulas, and examples

About the MTTR Calculator (Mean Time to Repair)

Mean Time to Repair (MTTR) measures the average time required to restore a system to operational status after a failure. It is one of the most important reliability and incident response metrics, directly impacting service availability and user experience.

This calculator computes MTTR from total repair/recovery time and the number of repair events. A lower MTTR indicates faster incident resolution, which contributes to higher overall availability. Teams use MTTR to benchmark their incident response capabilities, identify process bottlenecks, and track improvement over time.

When This Page Helps

MTTR directly determines how long users experience outages. By tracking and reducing MTTR, teams can significantly improve availability even without reducing failure frequency. It gives a direct MTTR computation to benchmark and improve your incident response process.

How to Use the Inputs

Sum the total time spent on all repairs/recoveries in the measurement period.
Enter the total repair time in minutes.
Enter the number of repair events.
Review the MTTR in minutes and hours.
Track MTTR trends over time to measure incident response improvements.

Formula used

MTTR = Total Repair Time / Number of Repairs. For 450 minutes across 6 incidents: MTTR = 75 minutes.

Example Calculation

Result: 75 minutes MTTR

With 450 total minutes spent on 6 repair events, the MTTR is 75 minutes (1.25 hours). This means on average, the team takes 1 hour and 15 minutes to restore service after a failure is detected.

Tips & Best Practices

Break MTTR into detection time, diagnosis time, fix time, and verification time to find bottlenecks.
Runbooks and automated remediation dramatically reduce MTTR.
Practice incident response through game days and chaos engineering.
A target MTTR under 1 hour is considered excellent for most services.
Oncall handoff procedures should preserve context to avoid MTTR spikes during transitions.
Post-incident reviews should focus on identifying and eliminating the largest MTTR contributors.

Understanding MTTR

MTTR is one of the four key DORA metrics that distinguish elite engineering teams. It measures how quickly your team can respond to and resolve production incidents, directly impacting user experience and business outcomes.

Components of Repair Time

Break down MTTR into its phases: detection (time from failure to alert), triage (time to assign and begin investigation), diagnosis (time to identify root cause), remediation (time to implement the fix), and verification (time to confirm restoration). Each phase offers optimization opportunities.

Strategies for MTTR Reduction

Improve detection with comprehensive monitoring and alerting. Speed triage with clear escalation policies. Accelerate diagnosis with distributed tracing and structured logging. Automate remediation for known failure patterns. Streamline verification with automated health checks.

Benchmarking and Trends

Track MTTR as a rolling average over 30, 60, and 90 days. Compare across services, teams, and incident severity levels. Use trend data to justify investments in observability, automation, and training.

Sources & Methodology

Last updated: February 8, 2026

Frequently Asked Questions

MTTR typically includes detection time, diagnosis time, repair/fix time, and verification time. Some definitions only include the actual repair phase. Clarify which phases are included in your organization's MTTR definition.