The cloud platform of Microsoft, Azure, had a significant breakdown on Wednesday, resulting in a worldwide disruption of various industries. Microsoft spent several hours restoring its systems, leaving users bewildered and frustrated throughout the disruption.
The outage, which was the cause of the trouble, started at around midday Eastern Time and was going on for more than eight hours. The outage affected Azure Communication Services, Media Services, and Microsoft 365 products that rely on the cloud network. By evening, Microsoft confirmed that performance had stabilized and that error rates had returned to pre-incident levels. However, a few customers continued to face minor issues while the company worked to resolve residual problems.
An outage impacted those situations to several organizations that disclosed the information. Among them, Alaska Airlines went through a push to its web and booking systems. Microsoft’s Azure global delivery network briefly disrupted online services at London’s Heathrow Airport, while Vodafone struggled to maintain its connections. Each company confirmed that they had resumed operations once Microsoft’s systems recovered.
According to data from Downdetector, user reports of Azure issues surged to more than 18,000 during the peak of the outage before dropping below 300 by early evening. Microsoft 365 followed a similar trend, with nearly 20,000 users initially reporting problems before numbers fell sharply later in the day.
Microsoft attributed the disruption to a configuration change within Azure Front Door, its content and application delivery network. The problem resulted in a significant number of customers experiencing timeouts and latency errors, which also affected Microsoft’s cloud-based tools. After the company’s engineering teams pulled the change that caused the issue, they also put in place extra monitoring to avoid such incidents again.
The blackout, which followed a large-scale disruption to Amazon Web Services, was less than a week apart. These two incidents in a row have made people question the reliability of hyperscale cloud infrastructure and its impact on the global digital operations, which is increasing rapidly.
Though the services have been restored, analysts are still warning that such situations expose the vulnerability of the tightly linked systems. In case enterprises rely on cloud networks for critical workloads, the incident is a warning that even the largest providers cannot be free from cascades of disruptions.
