AI Hypercomputer updates from Google Cloud Next 25

Share This Post


Our AI Hypercomputer underpins our Cloud customers’ most demanding AI workloads. Its hardware and software layers are optimized to deliver more intelligence per dollar for training and inference.

Today at Google Cloud Next 25, we introduced updates throughout the AI Hypercomputer stack:

  • AI-optimized hardware: Our new seventh-generation TPU, Ironwood, is designed specifically for thinking and inferential AI models. Ironwood offers five times more peak compute capacity and six times the high-bandwidth memory (HBM) capacity compared to the prior-generation TPU.
  • Software advances for inference: Updates to our AI Hypercomputer’s software layer help developers optimize compute resources, while speeding up AI workloads. These advances are shortening the time between training and inference.
  • Flexible consumption options: There are more ways for businesses to control costs with flexible consumption models in Dynamic Workload Scheduler.

Learn more about these AI infrastructure updates on the Google Cloud blog.



Source link

spot_img

Related Posts

The future of engineering belongs to those who build with AI, not without it

Join our daily and weekly newsletters for the...

Scientists Gene-Hack Spider to Produce Bright-Red Silk

Researchers used the popular gene-editing technique CRISPR to...

Meta to handover most of product risk assessments to AI

Mark Zuckerberg's Meta is planning to automate risk...

Samsung Galaxy Buds Explained: Which One is Right for You?

When it comes to wireless earbuds, Samsung's Galaxy...

Micro Center nerd store fills the Fry’s vacuum with its return to Silicon Valley

Silicon Valley nerds have been lonelier since Fry’s...

What I think the Apple Games app needs to work – and why it won’t

The rumour mill is frothing about a dedicated...
spot_img