Off-Prem

Google Cloud burst by 12-hour power outage in German region

Loose juice led to cooling issue in one zone, but the pain was widespread


Google Cloud apologized on Thursday after its europe-west3 region – located in Frankfurt, Germany – experienced an outage lasting half a day.

The incident began at 02:30 local time on Thursday, October 24 and ended at 15:09 – a total of 12 hours and 39 minutes.

"We apologize for the inconvenience this service disruption/outage may have caused," wrote the cloudy giant.

Google identified the root cause as a power failure and cooling issue that led to parts of one of the region's three zones, europe-west3-c, to power down. Degraded services inevitably followed.

"Google engineers implemented a fix to return the datacenter to full operation and this mitigated the issue," stated the advisory.

Services and features affected included: Cloud Build, Cloud Developer Tools, Cloud Machine Learning, Google Cloud Dataflow, Google Cloud Dataproc, Google Cloud Pub/Sub, Google Compute Engine, Google Kubernetes Engine, Persistent Disk, and Vertex AI Batch Prediction.

Users experienced a range of issues across several Google Cloud services.

On Google Compute Engine, some users faced VM creation failures, delays in processing deletions, and certain instances in the affected zone were unavailable for operations.

In Google Kubernetes Engine, nodes in the impacted location were inaccessible, and some attempts to create new nodes failed. Persistent Disk instances were unreachable, preventing operations on them.

Users of Google Cloud Dataflow saw delays in scaling workers for batch jobs, and some streaming jobs failed to progress or scale properly. Existing Google Cloud Dataproc clusters remained functional, but some attempts to create new clusters failed. Cloud Build users may have experienced delays in starting custom worker pools.

While most problems were experience at the zonal level, there was some regional-level impact.

"For the other two zones in the same region, less than one percent of the operations that touch instance and disk resources experienced internal errors," insisted Google.

Multi-zonal problems occurred with Vertex AI Batch Prediction, which failed for some with the error message "Unable to prepare an infrastructure for serving within time."

The chocolate factory first notified users of the failure 26 minutes after it began, but did not offer any workaround until almost three hours into the outage. It eventually told impacted users to migrate workloads to other regions or zones and advised those with a degraded regional persistent disk to take regular snapshots.

Although europe-west-3 is not particularly known for outages, it did experience an incident that affected cloud workstations late last year.

This past May, Google Cloud had a very bad day when failed maintenance ops brought pain to users of 33 services – including the Compute Engine and Kubernetes Engine – for approximately two hours and 48 minutes.

That incident occurred around a week after the cloud provider deleted Australian pension fund UniSuper's entire account. That error was attributed to a tragic alignment of a bug and a misconfiguration. ®

Send us news
26 Comments

Alphabet posts big revenue and profit growth, just 1,100 job losses

Google Cloud grows fast thanks to AI, which now writes a quarter of all G-code

Russian court fines Google $20,000,000,000,000,000,000,000,000,000,000,000

Don't hold your breath Putin

Delta officially launches lawyers at $500M CrowdStrike problem

Legal action comes months after alleging negligence by Falcon vendor

Samsung phone users under attack, Google warns

Don't ignore this nasty zero day exploit says TAG

Google reportedly developing an AI agent that can control your browser

Project Jarvis will apparently conduct research, purchase products, and even book a flight on your behalf

European datacenter energy consumption set to triple by end of decade

McKinsey warns an additional 25GW of mostly green energy will be needed

Tech giants set to pay through the nose for nuclear power that's still years away

Google, Amazon, Microsoft dive into costly deals that aren't generating anything yet

US Army should ditch tanks for AI drones, says Eric Schmidt

And what do you know, Google's former CEO just so happens to have a commercial solution

US lawmakers push DoJ to prosecute tax prep firms for leaking taxpayer data to big tech

TaxSlayer, H&R Block, TaxAct, and Ramsey Solutions accused of sharing info with Meta and Google

Amazon to cough $75B on capex in 2024, more next year

Despite extending server lifespans, AI's power demands drive more datacenter builds

Microsoft accused of 'greenwashing' as AI used in fossil fuel exploration

Activists press Redmond to come clean on ‘material reputational, legal, and operational risks’

Dropbox to shed another 500 staff, CEO takes 'full responsibility'

Cloudy concern has also spent over $500M buying back its own shares amid multiple rounds of layoffs