Crowdstrike Incident Report
Date of Incident: 07/19/2024
Time/Date Incident Started: 07/19/2024, 01:10 am EDT
Time/Date Stability Restored: 07/19/2024, 05:47 am EDT
Time/Date Incident Resolved: 07/19/2024, 10:00 am EDT
Users Impacted: All users
Frequency: Continuous
Impact: Major
Incident description:
On 7/19/2024 at 1:10 AM, The ServiceChannel Database Administration (DBA) and Site Reliability Engineering (SRE) teams received alerts from step-based test monitors that multiple ServiceChannel systems were failing their checks. Once alerted, the DBA and SRE teams immediately began investigating the issue's cause.
Root Cause Analysis:
A global outage caused by Crowdstrike, a third-party vendor providing a security Endpoint Detection and Response (EDR) platform, temporarily impacted the performance of ServiceChannel SaaS applications.
There was no security impact as this was a third-party software component that caused the degradation of our services.
Actions Taken:
Upon further investigation, the SRE team identified a mitigation strategy for each affected asset:
The ServiceChannel SRE team applied the mitigation across all affected assets.
Mitigation Measures: