Service disruption for Impact in the Canada (CA-CENTRAL-1) region

Incident Report for Instructure

Postmortem

Date of Incident: August 12, 2025

Overview

On August 12, customers in our CA-CENTRAL-1 region experienced degraded service. The root cause was a significant power outage within one of our cloud provider's (Amazon Web Services) Availability Zones. This outage impacted the underlying infrastructure, including EC2 instances and EBS volumes, which support our SaaS platform. Our engineering team engaged immediately to monitor the situation and mitigate customer impact where possible. Service was fully restored once our provider resolved the power issue and all systems recovered.

Timeline of Events (MDT)

3:58 MDT: Automated monitoring detected service impact for customers in the CA-CENTRAL-1 region, and our on-call engineering team was engaged.
5:21 MDT: Our cloud provider reported that power had been restored to the affected data center. We observed initial signs of recovery across our platform.
5:55 MDT: All systems were confirmed to be fully operational, and normal service levels were restored for all impacted customers.

Resolution

The incident was fully resolved at 5:55 MDT. The total duration of the impact, from detection to full resolution, was 1 hour and 57 minutes.
‌

Next Steps

To improve resilience against future upstream provider issues, we are taking the following actions:

Review Regional Redundancy: We will conduct a thorough review of our regional redundancy and failover procedures to reduce the potential impact of single-region provider outages.
Enhance Provider Collaboration: We will continue to collaborate with our cloud provider to improve strategies for early detection, transparent communication, and faster mitigation of infrastructure-level incidents.

Posted Aug 12, 2025 - 06:55 MDT

Resolved

This incident has been resolved.

Posted Aug 12, 2025 - 05:55 MDT

Monitoring

Our cloud provider has restored power in the affected CA-CENTRAL-1 Availability Zone and is seeing signs of recovery. Some customers may still experience intermittent issues as AWS continues to restore all affected systems. We are monitoring their progress closely and will provide another update once services are fully stable.

Posted Aug 12, 2025 - 05:21 MDT

Investigating

We are currently investigating an issue affecting a subset of customers in the CA-CENTRAL-1 region. The disruption is related to a power issue in one of our cloud provider’s Availability Zones, which is affecting connectivity to some of our services. Our engineering team is working with the provider to restore full functionality. We will share an update as soon as more information becomes available.

Posted Aug 12, 2025 - 04:00 MDT

This incident affected: Impact.