Establish Blameless Post-Mortems & Incident Reviews
This standard mandates the establishment of blameless post-mortems and incident reviews to treat failures as learning opportunities to improve systems and processes.
1. Establish Blameless Post-Mortems & Incident Reviews:
Treat failures as learning opportunities to improve systems and processes. This approach ensures that incidents are used to improve reliability and resilience.
- 1.1 Blameless Post-Incident Reviews:
- 1.1.1 Actionable Follow-ups:
- Conduct blameless post-incident reviews with actionable follow-ups.
- Automate the scheduling of post-incident reviews.
- 1.1.2 Review Management:
- Automate the tracking of review action items.
- Implement review tutorials.
- 1.2 Recurring Failure Pattern Tracking:
- 1.2.1 Root Cause Addressing:
- Track recurring failure patterns and proactively address root causes.
- Automate the tracking of failure patterns.
- 1.2.2 Pattern Management:
- Automate the tracking of root cause resolutions.
- Implement pattern feedback collection.
- 1.3 Cross-Team Insight Sharing:
- 1.3.1 Resilience Improvement:
- Share insights across teams to improve resilience and reliability.
- Automate the sharing of incident insights.
- 1.3.2 Sharing Management:
- Automate the tracking of insight sharing.
- Implement sharing tutorials.
By establishing blameless reviews, organisations can improve system resilience and reliability.