Agentic AI, AI and Us, AI Business Strategy, AI Hardware & Chips, AI in Action, AI Market Trends, Blackwell, computer-vision, Data Engineering & MLOps, data sovereignty, digital twins, Environment & Sustainability, Featured News, Features, gemini, Google Cloud, governance, Governance, Regulation & Policy, How It Works, Inference, Infrastructure & Hardware, Inside AI, multimodal-ai, Natural Language Processing (NLP), nvidia, Omniverse, Open-Source & Democratised AI, Physical AI, reinforcement-learning, robotics

NVIDIA and Google infrastructure cuts AI inference costs

At the Google Cloud Next conference, Google and NVIDIA outlined their hardware roadmap designed to address the cost of AI inference at scale. The companies detailed the new A5X bare-metal instances, which run on NVIDIA Vera Rubin NVL72 rack-scale systems. Through hardware and software codesign, this architecture aims to deliver up to ten times lower […]

The post NVIDIA and Google infrastructure cuts AI inference costs appeared first on AI News.