Hacker News with Generative AI: Incident Response

Meta Uses LLMs to Improve Incident Response (tryparity.com)
In June, Meta released an article titled Leveraging AI for efficient incident response on their engineering blog. In this article, engineers outline how they leveraged large language models to improve Meta's incident response capabilities. The headline metric from this report: Meta was able to use LLMs to successfully root cause incidents with 42% accuracy in their web monorepo. This means that nearly half the time, the mean time to resolution (MTTR) can potentially be reduced from hours to seconds.
Leveraging AI for efficient incident response (engineering.fb.com)
My post-mortem on the CrowdStrike incident (onboardbase.com)
Microsoft technical breakdown of CrowdStrike incident (microsoft.com)
CrowdStrike Incident Preliminary Post Incident Review (crowdstrike.com)
Preliminary Post Incident Review (crowdstrike.com)
CrowdStrike Incident Analysis (twitter.com)
Choose your own adventure style Incident Response (cmdzero.io)