AI News Hub Logo

AI News Hub

NVIDIA and Google infrastructure cuts AI inference costs

AI News
Ryan Daws

At the Google Cloud Next conference, Google and NVIDIA outlined their hardware roadmap designed to address the cost of AI inference at scale. The companies detailed the new A5X bare-metal instances, which run on NVIDIA Vera Rubin NVL72 rack-scale systems. Through hardware and software codesign, this architecture aims to deliver up to ten times lower […] The post NVIDIA and Google infrastructure cuts AI inference costs appeared first on AI News.