Article

Hyperscale

The Hyperscale Revolution Reshaping Data Centre Architecture

By Marcus Law

September 20, 2025

undefined mins

Share this article

Prioritise Us on Google

Share this article

Prioritise Us on Google

How will the design and construction of data centres evolve with the hyperscale revolution? (Credit: AWS)

AI workloads demanding 50kW+ rack densities are forcing a fundamental rethink of data centre architecture from cooling systems to custom silicon design

In 2015, there were 259 hyperscale data centres worldwide. Today, there are over 1,136 facilities, and the market has grown into a US$167bn industry expected to reach US$600bn by 2030. But the real story isn’t in the numbers – it’s in how AI is forcing a complete rethink of data centre design.

Traditional enterprise data centres, built for 5-10kW rack densities, are hitting a wall. AI workloads need 50kW+ power densities to run GPU clusters, and that changes everything. The cooling systems, the power distribution, the network architecture – none of it was designed for this kind of heat and computational load.

Building at a different scale

Hyperscale facilities operate on a completely different level than typical data centres. Where enterprise facilities might house hundreds of servers consuming 1-5MW of power, hyperscale operations deploy millions of servers across campuses that consume 20-100+ MW. Individual facilities exceed 10,000 square feet with at least 5,000 servers, though modern hyperscale campuses often span multiple buildings.

Sundar Pichai, CEO Google and Alphabet

“When we go through a curve like this, the risk of under-investing is dramatically greater than the risk of over-investing,” Google CEO Sundar Pichai said when announcing the company’s US$75bn commitment to AI infrastructure in 2025 at the company’s 2025 Next conference. “Even if it turns out that we are over-investing, these are infrastructure which are widely useful for us.”

The cooling crisis and liquid solutions

AI has created a thermal problem that traditional cooling can’t solve. The heat generated by thousands of GPUs running AI training workloads requires completely different approaches to heat removal. Air cooling, which has dominated data centres for decades, becomes inadequate when individual racks consume 50kW or more – roughly equivalent to powering 40 typical homes.

This shift has transformed liquid cooling from a niche technology into a rapidly growing market projected to reach US$21.14bn by 2032. Direct-to-chip liquid cooling systems pump coolant directly to processors through cold plates, achieving 30-40% energy efficiency gains whilst handling extreme power densities. Immersion cooling goes further, submerging entire servers in dielectric fluid that doesn’t conduct electricity but efficiently removes heat.

The efficiency gains are substantial. Google’s facilities already achieve 1.1 PUE (Power Usage Effectiveness) compared to industry averages of 1.67-1.8. This means that for every watt used by computing equipment, Google’s facilities only require an additional 0.1 watts for cooling and other infrastructure, whilst typical data centres need 0.67-0.8 watts.

Rajesh Sennik, Head of Data Centre Advisory at KPMG UK

“The industry is now facing unprecedented demand for new infrastructure solutions to efficiently power, cool and support this next generation of compute and as a result, AI is fundamentally reshaping the architecture of IT infrastructure,” says Rajesh Sennik, Head of Data Centre Advisory at KPMG UK.

Custom silicon and the move away from commodity hardware

The hyperscale operators have fundamentally altered their approach to hardware procurement, moving from off-the-shelf servers to custom-designed systems optimised for their specific workloads.

Andy Jassy, President and CEO at Amazon (Credit: Amazon)

Amazon’s Trainium chips are designed specifically for machine learning training workloads, whilst its Inferentia processors handle inference tasks. “Our new Trainium2 chips offer 30-40% better price-performance than the current GPU-powered compute instances generally available today,” Amazon CEO Andy Jassy wrote in his 2024 shareholder letter.

Google has taken custom silicon even further with its Tensor Processing Units (TPUs). Its seventh-generation TPU, codenamed ‘Ironwood’, is designed specifically for Google’s AI workloads and offers performance characteristics that can't be achieved with general-purpose processors.

Custom silicon allows hyperscale operators to reduce their dependence on traditional semiconductor suppliers whilst optimising power consumption for their specific use cases. This vertical integration provides both cost advantages and strategic control over their technology stack.

Network architecture transformation

The shift to AI workloads has necessitated a complete rethink of data centre networking. Traditional three-tier network architectures, designed for north-south traffic patterns between clients and servers, struggle with the east-west communication patterns that dominate AI training workloads.

Spine-leaf topologies have emerged as the dominant architecture for hyperscale facilities. This two-layer design connects every leaf switch (connected to servers) to every spine switch, ensuring that any server can communicate with any other server through a maximum of three network hops. This architecture provides predictable latency and can scale horizontally by adding more spine switches.

The bandwidth requirements are massive. 400G networking, which seemed cutting-edge just a few years ago, is rapidly giving way to 800G and 1.6T connections. The Ultra Ethernet Consortium, formed by Arista, Broadcom, Intel, Meta and Microsoft, is developing standards specifically for AI workloads that could challenge InfiniBand's dominance in high-performance computing.

Inside one of Microsoft's data centres (Credit: Microsoft)

Software-defined networking (SDN) has become essential for managing these complex topologies. Hyperscale operators use SDN controllers to automatically configure network paths, balance traffic loads, and respond to failures without human intervention. This automation is critical given the scale of these networks: a single hyperscale facility might contain hundreds of thousands of network ports.

The power constraint crisis

The industry's growth trajectory has created an unprecedented challenge that threatens to constrain AI development: power availability. US data centre power demand hit 46,000MW in Q3 2024, and projections show another 35 GW will be needed by 2030: the equivalent to adding 35 nuclear power plants.

Gartner expects that 40% of existing AI data centres will hit power constraints by 2027 with Northern Virginia, historically the dominant hyperscale market, hitting power grid limits that are forcing operators to look elsewhere.

The power requirements for AI workloads are fundamentally different from traditional computing. Training large language models requires sustained high power consumption across thousands of GPUs running in parallel for weeks or months. Unlike web servers that can scale up and down based on demand, AI training workloads run at full capacity continuously.

This has led to innovative approaches to power procurement. Hyperscale operators are increasingly partnering directly with utilities to develop new generation capacity, often focusing on renewable sources to meet sustainability commitments. Amazon has signed agreements for over 15 GW of renewable energy capacity, whilst Google has committed to achieving net-zero emissions across its operations by 2030.

The challenge extends beyond simply finding available power capacity to identifying strategic locations that can support long-term growth.

Cara Mascini, Chief Sustainability Officer at Switch Datacenters

“Where do you find the power? Where are those hotspots of take-off?” asked Cara Mascini, Chief Sustainability Officer at Switch Datacenters, during the Global Hyperscale Strategies panel at Data Centre LIVE 2025. “What we’re doing is looking where it makes sense to set up a data centre and where we can integrate with a great company to solve some of their issues.”

However, the environmental challenges are significant. Despite ambitious climate goals, Microsoft’s CO2 output increased 30% since 2020, whilst Google’s rose 48% from 2019.

Geographic expansion and market dynamics

Power constraints and land availability are driving hyperscale expansion into new markets. The traditional locations – Northern Virginia, Silicon Valley and other established tech hubs – are reaching capacity limits that force operators to consider secondary markets.

Atlanta absorbed 95MW in 2024 with another 2,159.3MW under construction, making it one of the fastest-growing hyperscale markets. The city offers relatively abundant power, land availability and good connectivity to major population centres. Similar patterns are emerging in Phoenix, Dallas and other secondary markets.

The scale of this geographic shift reflects broader market dynamics driving hyperscale expansion.

Yashnath Issur, CEO of Nxtra by Airtel Africa

“There’s a 20% to 30% growth globally on movement from in-house to cloud-based solutions, and the AI machine learning craze has taken over the world,” noted Yashnath Issur, CEO of Nxtra by Airtel Africa, during Data Centre LIVE 2025. This migration from traditional enterprise infrastructure to hyperscale facilities is accelerating the search for new locations with adequate power and connectivity infrastructure.

International expansion is accelerating too. Nordic countries attract investment for their combination of cheap, renewable electricity and natural cooling. The cold climate reduces cooling requirements, whilst abundant hydroelectric power provides clean energy at competitive prices.

Asia-Pacific represents the fastest-growing region at 29.1% annually, driven by digitisation efforts across emerging economies. However, regulatory complexity and local data sovereignty requirements create challenges for operators accustomed to globally distributed architectures.

Investment scale and market concentration

The financial commitment required for hyperscale operations continues to grow at an extraordinary pace.

Microsoft plans US$80bn in AI data centre investments for fiscal 2025, whilst Meta is targeting US$60-65bn. “Over half of the expected AI infrastructure spending will take place in the US,” Microsoft President Brad Smith said in January 2025.

This level of investment creates significant barriers to entry. Amazon, Microsoft and Google control 59% of hyperscale capacity, and their massive infrastructure investments make it increasingly difficult for competitors to achieve similar scale. The capital requirements for building hyperscale facilities – often exceeding US$1bn per campus – limit the number of players who can compete effectively.

However, specialised providers focused on AI workloads are finding opportunities. 137 new facilities came online in 2024 with 504 more in planning, indicating continued growth across the sector.

Mark Zuckerberg, CEO of Meta

“I actually think all the companies that are investing are making a rational decision,” says Meta’s Mark Zuckerberg, “because the downside of being behind is that you're out of position for the most important technology for the next 10 to 15 years.”

The Hyperscale Revolution Reshaping Data Centre Architecture

Building at a different scale

The cooling crisis and liquid solutions

Custom silicon and the move away from commodity hardware

Network architecture transformation

The power constraint crisis

Geographic expansion and market dynamics

Investment scale and market concentration

Company portals

Amazon

Google

KPMG

Meta

Microsoft

Switch

Tags