# Incident Response — Resources

## Official Guides

- [Google SRE Book](https://sre.google/sre-book/) — Free online. Covers incident response, postmortems, SLOs, and error budgets. Essential reading.
- [Google SRE Book — Managing Incidents](https://sre.google/sre-book/managing-incidents/) — Direct chapter on incident management.
- [PagerDuty Incident Response Guide](https://response.pagerduty.com) — Roles, workflows, communication, and best practices.
- [PagerDuty — Postmortem Best Practices](https://response.pagerduty.com/before/incident-response/postmortem/) — Blameless postmortems.

## Articles

- [Blameless PostMortems and a Just Culture](https://codeascraft.com/2013/11/14/blameless-postmortems-and-a-just-culture/) — Etsy (Code as Craft). Why blameless matters and how to build the culture.
- [Writing an Incident Postmortem](https://www.atlassian.com/incident-management/postmortem) — Atlassian. Template and examples.
- [The Five Whys](https://www.atlassian.com/incident-management/postmortem/blameless) — Using 5 whys in postmortems.
- [Incident Communication Best Practices](https://www.pagerduty.com/blog/incident-communication-best-practices/) — PagerDuty. Status updates and stakeholder communication.
- [On-Call Best Practices](https://www.pagerduty.com/resources/learn/on-call-best-practices/) — PagerDuty. Rotation, runbooks, fatigue prevention.

## Books

- **Site Reliability Engineering** (O'Reilly) — Google SRE book in print. Covers incidents, SLOs, and more.
- **The Phoenix Project** by Gene Kim et al. — Novel about IT ops and incident response; illustrates principles.
- **An Elegant Puzzle** by Will Larson — Includes on-call, incident process, and team reliability.

## Tools

- [PagerDuty](https://www.pagerduty.com/) — Incident alerting, on-call scheduling, escalation.
- [OpsGenie](https://www.atlassian.com/software/opsgenie) — Alerting and on-call management.
- [Statuspage](https://www.atlassian.com/software/statuspage) — Public status pages.
- [Rootly](https://www.rootly.com/) — Incident management and runbooks.
- [Incident.io](https://incident.io/) — Incident response workflow and automation.

## Videos

- [Google SRE — How We Do It](https://www.youtube.com/results?search_query=google+sre+incident) — Search for talks on incident response.
- [Blameless Postmortems at Etsy](https://www.youtube.com/results?search_query=blameless+postmortem+etsy) — Culture and process.

## Podcasts

- [Engineering Culture by InfoQ](https://www.infoq.com/podcasts/engineering-culture/) — Episodes on reliability and incidents.
- [SRE Weekly](https://sreweekly.com/) — Newsletter; often covers incident and postmortem content.
