Follow Us

Microsoft blames OS upgrade for Azure outage

Promises to tweak recovery algorithm.

Microsoft has admitted that an outage of its Azure cloud-computing infrastructure over the weekend was caused by a routine operating system upgrade.

In a post on its Windows Azure blog, Microsoft said after the upgrade "the deployment service within Windows Azure began to slow down due to networking issues" on Friday. "This caused a large number of servers to time out and fail," which brought some applications down, according to the post.

Microsoft said the Fabric Controller, a feature built into Azure that manages network resources and performs functions such as load balancing, automatically began taking steps to recover applications that were affected and move them to different servers. Only applications that were running in a single instance on the network went down during the outage, the company said.

"Very few applications running multiple instances went down, although some were degraded due to one instance being down," according to the post.

The ability to perform management tasks from Azure's web portal was also unavailable for many applications during the outage, which lasted between about 10:30 pm Pacific Time Friday and 8:30 pm Pacific Time Saturday.

Microsoft said it is refining and tuning Azure's recovery algorithm so when malfunctions occur in the future they will be handled "quickly and gracefully," it said in the post.

The company also recommends that people running applications in Azure deploy them in multiple instances, and will make two the "default in our project templates and samples," according to the post.

"We will not count the second instance against quota limits, so [users] can feel comfortable running two instances of each application role," Microsoft said.

Microsoft confirmed late Monday that Azure users suffered an overnight outage over the weekend during which their applications weren't available.

Currently only a test release of Azure is available, and some early adopters are running applications on it. Users can't expect an early test release of a product to run smoothly without hiccups. However, Azure is a proving ground for how well Microsoft will be able to support the development and deployment of hosted enterprise applications, for which even a short amount of downtime can pose a big problem.

Moreover, last week both Google and Microsoft had outages on their Gmail and Hotmail email services. Outages raise questions about the ability of these companies and other online service providers to maintain a consistent quality of service for end users over the long term.

Microsoft unveiled Azure at its Professional Developers Conference (PDC) in Los Angeles in October and according to public comments made by CEO Steve Ballmer last month, plans to make the infrastructure generally available by November at this year's PDC.






Send to a friend

Email this article to a friend or colleague:

PLEASE NOTE: Your name is used only to let the recipient know who sent the story, and in case of transmission error. Both your name and the recipient's name and address will not be used for any other purpose.

Techworld White Papers

State of software security report volume 4

If your business has anything worth protecting, be it money, intellectual property or a trusted...

Download Whitepaper

New threats demand innovative responses

Financial institutions in the UK remain susceptible to further systemic problems, as challenging...

Download Whitepaper

Delivering a competitive advantage through IT

IT organisations share a common mission; to optimise investments and streamline operations to...

Download Whitepaper

6 tips to mobilise your existing ERP

Enterprise mobile users throughout the global business community will number 1.19 billion by...

Download Whitepaper

Techworld UK - Technology - Business

Techworld Awards

Techworld Awards Winners 2011


Learn who the winners of this year's Techworld Awards are. Video footage coming soon...

Find out more
Techworld Mobile Site

Access Techworld's content on the move

Get the latest news, product reviews and downloads on your mobile device with Techworld's mobile site.

Find out more...

Site Map

* *