AI Hypercomputer

Announcing Ironwood, our most powerful, capable, and energy efficient TPU yet, designed to power thinking, inferential AI models at scale.

AI Hypercomputer

The supercomputing system underneath every AI workload on Google Cloud. Customize its components using fully integrated hardware, open software, and flexible consumption models.

Blog: Introducing Ironwood TPUs and new innovations in AI Hypercomputer

Overview

AI-optimized hardware

Choose from compute, storage, and networking options optimized for granular, workload-level objectives, whether that's higher throughput, lower latency, faster time-to-results, or lower TCO. Learn more about: Google Cloud TPU, Google Cloud GPU, Google Cloud Storage, Titanium, the Jupiter network.

Power your LLMs with Google Cloud TPU

Learn how Google Cloud’s custom designed AI Accelerator—Google Cloud TPU optimizes performance for your LLM workloads.

Watch on-demand

Open software

AI Hypercomputer is optimized to support the most common tools and libraries such as Pytorch and JAX. Plus it allows customers to take advantage of technologies such as Cloud TPU Multislice and Multihost configurations and managed services like Google Kubernetes Engine. This allows customers to deliver turnkey deployment for common workloads like the NVIDIA NeMO framework orchestrated by SLURM.

Open LLMs on GKE-Llama 2 and Beyond

Discover how you can take your gen AI platform game to the next level with Open LLMs on GKE-Llama 2 and Beyond.

Watch on-demand

Flexible consumption

Our flexible consumption models allow customers to choose fixed costs with committed use discounts or dynamic on-demand models to meet their business needs. Dynamic Workload Scheduler helps customers get the capacity they need without over allocating so they are only paying for what they need. Plus, Google Cloud's cost optimization tools help automate resource utilization to reduce manual tasks for engineers.

Optimize resource access and economics for AI/ML workloads

Learn how the Dynamic Workload Scheduler service optimizes your AI workload execution.

Read the blog

How It Works

Google is a leader in artificial intelligence with the invention of technologies like TensorFlow. Did you know you can leverage Google’s technology for your own projects? Learn about Google's history of innovation in AI infrastructure and how you can leverage it for your workloads.

Google Cloud AI Hypercomputer architecture diagram alongside the Google Cloud product manager Chelsie's photo

Common Uses

Run large-scale AI training

Powerful, scalable, and efficient AI training

The AI Hypercomputer architecture offers optionality to use the underlying infrastructure that best scales to meet your training needs.

Three Charts Describing AI Growth Factors

How-tos

Powerful, scalable, and efficient AI training

The AI Hypercomputer architecture offers optionality to use the underlying infrastructure that best scales to meet your training needs.

Additional resources

Powerful, scalable, and efficient AI training

Measure the effectiveness of your large scale training the Google way with ML Productivity Goodput.

Training Speed TPUv4(bf16) vs TPUv5(int8)

Customer examples

Character AI leverages Google Cloud to scale up

"We need GPUs to generate responses to users' messages. And as we get more users on our platform, we need more GPUs to serve them. So on Google Cloud, we can experiment to find what is the right platform for a particular workload. It's great to have that flexibility to choose which solutions are most valuable." Myle Ott, Founding Engineer, Character.AI

Myle Ott, Founding Engineer, Character.AI

1:36

Deliver AI powered applications

Leverage open frameworks to deliver AI powered experiences

Google cloud is committed to ensuring open frameworks work well within the AI Hypercomputer architecture.