Skip to main content

Blog

Google’s TPU 8t and TPU 8i address agentic AI

Posted by

mindrevmedia01@gmail.com

April 28, 2026

On April 28, 2026

0

They are specifically designed for AI training (TPU 8t) and inference (TPU 8i), says the search giant, and for working with Google DeepMind.

“These two chips are designed to power our custom-built supercomputers, to drive everything from cutting-edge model training and agent development, to massive inference workloads,” writes Amin Vahdat, Google’s Chief Technologist for AI & Infrastructure, right.

“TPUs have been powering leading foundation models, including Gemini, for years. These 8th generation TPUs together will deliver scale, efficiency and capabilities across training, serving and agentic workloads.”

The announcement was made at Google Cloud Next ’26, but technical details are scarce.

TPU 8t

Google says a single TPU 8t superpod (a customised network of 64 boards) now scales to 9,600 chips and two petabytes of shared high bandwidth memory. This is with double the interchip bandwidth of the previous generation. It says the architecture delivers 121 ExaFlops of compute, which allows the most complex models to use a single, massive pool of memory.

It also uses the company’s Virgo Network, which is an AI-oriented networking system, along with JAX and Pathways software. It means, says Google, the TPU 8t can provide near-linear scaling for up to a million chips in a single logical cluster.

A comparison of TPU 8t and its predecessor, codenamed Ironwood, is shown below.

TPU 8i

In terms of TPU 8i, for AI inferencing, Google states the system delivers nearly 3x the compute performance per pod over the previous generation.

Details include that the TPU 8i pairs 288 GB of high-bandwidth memory with 384MB of on-chip SRAM. This is 3x more than the previous generation, and can keep a model’s active working set entirely on-chip.

Google also says it has doubled the physical CPU hosts per server, moving to its custom Axion Arm-based CPUs.

“By using a non-uniform memory architecture (NUMA) for isolation, we have optimized the full system for superior performance,” states Vahdat.

For modern Mixture of Expert (MoE) models, Google states it has doubled the Interconnect (ICI) bandwidth to 19.2 Tb/s. Its new Boardfly architecture aims to “reduce the maximum network diameter by more than 50%, ensuring the system works as one cohesive, low-latency unit”.

A new on-chip Collectives Acceleration Engine (CAE) offloads global operations, reducing on-chip latency by up to 5x, to minimise lag.

You can read more on this Google blog post.

TPU2

Back in 2017 Google announced its second-generation TPU, the TPU2, delivering a now relatively-modest 45Tflops.

A system board with four TPU2s would deliver 180Tflops and a customised network of 64 boards, called a TPU pod, 11.5 petaflops.

See all our Google content.

This Sales Gallery in India Takes Cues From Organic Forms

Older Faisal Islam: Why the UAE's exit from Opec is a big deal

Related Posts

Meta Signs Agreement With AWS to Power Agentic AI on AWS Graviton Chips

mindrevmedia01@gmail.com

0

25 Apr 2026

April 25, 2026

Meta Signs Agreement With AWS to Power Agentic AI on AWS Graviton Chips

Meta has signed an agreement to deploy AWS Graviton processors at scale. The deal marks a significant expansion of a lon...

Continue reading

Oscar Piastri leads a trail of cars on track during the Japanese Grand Prix

mindrevmedia01@gmail.com

0

20 Apr 2026

April 20, 2026

Formula 1 makes series of rule changes to address new engine regulation concerns

The in-race changes are mainly targeted at ensuring sudden speed differentials between cars in different deployment stat...

Continue reading

An image of Arthur C. Clarke with a quote: “Any sufficiently advanced technology is indistinguishable from magic.” The image shows him in glasses and in black and white with the quote captured in color. This quote is later referejcend in the article.

mindrevmedia01@gmail.com

0

17 Apr 2026

April 17, 2026

Autopilot, agentic AI, and the dangers of imperfect metaphors

The language used to articulate AI is increasingly selling mathematics as magic and agency as intelligence; the words we use ...

Continue reading

Intel and SambaNova Advance Agentic AI with Xeon 6

mindrevmedia01@gmail.com

0

08 Apr 2026

April 8, 2026

Intel and SambaNova Advance Agentic AI with Xeon 6

SambaNova today announced the next phase of its collaboration with Intel: a heterogeneous hardware solution that combine...

Continue reading

Oil jumps and shares fall after US president address

mindrevmedia01@gmail.com

0

02 Apr 2026

April 2, 2026

Oil jumps and shares fall after US president address

The US president says he will bring Iran "back to the Stone Age" but failed to say how the war will end.

Continue reading

Middle East crisis live: Trump says US close to ‘finishing the job’ in Iran during prime-time address | US-Israel war on Iran

mindrevmedia01@gmail.com

0

02 Apr 2026

April 2, 2026

Middle East crisis live: Trump says US close to ‘finishing the job’ in Iran during prime-time address | US-Israel war on Iran

Trump once ag...

Continue reading

Australia news live: Angus Taylor urges clarity on fuel crisis and says PM’s national address ‘could have been a social media post’ | Australia news

mindrevmedia01@gmail.com

0

01 Apr 2026

April 1, 2026

Australia news live: Angus Taylor urges clarity on fuel crisis and says PM’s national address ‘could have been a social media post’ | Australia news

Key eventsShow key events onlyPlease turn on JavaScript to use this feature

Continue reading

Trump to give primetime address on Iran war as questions swirl over his next move

mindrevmedia01@gmail.com

0

01 Apr 2026

April 1, 2026

Trump to give primetime address on Iran war as questions swirl over his next move

The US president finds himself under growing pressure at home to avoid a protracted conflict.

Continue reading

Agentic AI, design systems & Figma: a practical guide

mindrevmedia01@gmail.com

0

31 Mar 2026

March 31, 2026

Agentic AI, design systems & Figma: a practical guide

The Figma basics you were told to get right just became the foundation for something much bigger.I watched a demo recently th...

Continue reading

A human approach to Agentic AI. One person. One text file. Five agents.

mindrevmedia01@gmail.com

0

30 Mar 2026

March 30, 2026

A human approach to Agentic AI. One person. One text file. Five agents.

How I built my own AI team with nothing but text files, a conversation in Claude Cowork, and a lot of character.I built what ...

Continue reading

NVIDIA Releases GeForce 596.02 Hotfix Beta Driver to Address Game Stuttering

mindrevmedia01@gmail.com

0

26 Mar 2026

March 26, 2026

NVIDIA Releases GeForce 596.02 Hotfix Beta Driver to Address Game Stuttering

NVIDIA has officially released its second hotfix driver this month, GeForce 596.02 Hotfix, addressing the stuttering iss...

Continue reading

PEGI Updates Game Ratings To Address Loot Boxes and Potentially Addictive Gameplay Mechanics

mindrevmedia01@gmail.com

0

13 Mar 2026

March 13, 2026

PEGI Updates Game Ratings To Address Loot Boxes and Potentially Addictive Gameplay Mechanics

Hot on the heels of Valve's critical public response to the New York State attorney general's lawsuit against the gaming...

Continue reading

Leave a Reply Cancel reply