Prioritise Recovery Time Over Uptime Guarantees
This standard mandates the prioritisation of recovery time over uptime guarantees to ensure teams can respond and recover quickly rather than avoiding all failures.
1. Prioritise Recovery Time Over Uptime Guarantees:
Ensure teams can respond and recover quickly rather than avoiding all failures. This approach ensures rapid incident response and reduces downtime.
- 1.1 Mean Time Tracking:
- 1.1.1 MTTD and MTTR Definition:
- Define and track Mean Time to Detect (MTTD) and Mean Time to Recover (MTTR).
- Automate the tracking of MTTD.
- 1.1.2 MTTR Tracking:
- Automate the tracking of MTTR.
- Implement MTTD and MTTR tutorials.
- 1.2 Incident Response Automation:
- 1.2.1 Playbook Implementation:
- Automate incident response with predefined playbooks.
- Automate the execution of incident response playbooks.
- 1.2.2 Playbook Management:
- Automate the tracking of playbook usage.
- Implement playbook feedback collection.
- 1.3 Disaster Recovery (DR) Testing:
- 1.3.1 DR Plan Improvement:
- Regularly test and improve disaster recovery (DR) plans.
- Automate the execution of DR tests.
- 1.3.2 DR Management:
- Automate the tracking of DR test results.
- Implement DR tutorials.
By prioritising recovery time, organisations can ensure rapid incident response and reduce downtime.