Dashboard Latency - Incident Report
Date of Incident: 5/16/2024
Time/Date Incident Started: 5/16/2024, 10:18 am EDT
Time/Date Stability Restored: 5/16/2024, 11:37 am EDT
Time/Date Incident Resolved: 5/16/2024, 12:00 pm EDT
Users Impacted: Some Users
Frequency: Intermittent
Impact: Minor
Incident description:
Some users of ServiceChannel that were utilizing the dashboard experienced slow loading times.
Root Cause Analysis:
Around 10:18 AM EDT, the ServiceChannel Site Reliability Engineering (SRE) team was alerted to slow response times on the Dashboard, affecting customer experience. The team quickly looked into the matter and identified that one of the ServiceClick application pools was exhibiting unusually high response times. Initial attempts to rectify the issue by restarting individual nodes did not resolve the problem. Further investigation led to the decision to reboot the entire application pool for ServiceClick. This measure effectively reduced response times and returned our services to their standard operational state.
Actions Taken:
Mitigation Measures: