NVIDIA Unveils GB300 NVL72 with 50x Performance-Per-Watt Improvement Over Hopper

Key Takeaways

▸NVIDIA's GB300 NVL72 delivers 50x better performance per watt compared to the Hopper platform
▸The new system reduces inference costs by 35x per million tokens, significantly lowering operational expenses for AI deployments
▸These improvements address critical industry concerns around energy efficiency and the economic viability of large-scale AI inference

Source:

X (Twitter)https://x.com/nvidia/status/2023479202981224472/photo/1↗

Loading tweet...

Summary

NVIDIA has announced significant performance advances with its GB300 NVL72 system, marking a substantial generational leap over its previous Hopper platform. The company claims the new system delivers 50 times better performance per watt and reduces costs by 35 times per million tokens, positioning it as a major advancement in AI inference capabilities.

The GB300 NVL72 represents NVIDIA's continued focus on optimizing inference performance, a critical component for deploying AI models at scale. The dramatic improvements in both energy efficiency and cost efficiency address two of the most pressing challenges facing enterprises running large language models and other AI workloads in production environments.

These efficiency gains come at a crucial time as AI inference costs have become a significant concern for companies deploying LLMs and other generative AI applications. The 35x reduction in cost per million tokens could dramatically lower the barrier to entry for AI adoption across industries, while the 50x improvement in performance per watt addresses growing concerns about the environmental impact and operational costs of AI infrastructure.

Editorial Opinion

NVIDIA's claimed performance improvements, if validated in real-world deployments, could fundamentally reshape the economics of AI inference. The 35x cost reduction per million tokens is particularly significant as inference costs have emerged as a major barrier to widespread LLM adoption. However, these figures likely represent peak performance under optimal conditions, and actual enterprise deployments may see more modest gains depending on specific workloads and configurations.

NVIDIA

PRODUCT LAUNCH NVIDIA2026-02-16

NVIDIA Unveils GB300 NVL72 with 50x Performance-Per-Watt Improvement Over Hopper

Key Takeaways

▸NVIDIA's GB300 NVL72 delivers 50x better performance per watt compared to the Hopper platform
▸The new system reduces inference costs by 35x per million tokens, significantly lowering operational expenses for AI deployments
▸These improvements address critical industry concerns around energy efficiency and the economic viability of large-scale AI inference

Source:

X (Twitter)https://x.com/nvidia/status/2023479202981224472/photo/1↗

Loading tweet...

Summary

Editorial Opinion

NVIDIA's claimed performance improvements, if validated in real-world deployments, could fundamentally reshape the economics of AI inference. The 35x cost reduction per million tokens is particularly significant as inference costs have emerged as a major barrier to widespread LLM adoption. However, these figures likely represent peak performance under optimal conditions, and actual enterprise deployments may see more modest gains depending on specific workloads and configurations.

NVIDIA Unveils GB300 NVL72 with 50x Performance-Per-Watt Improvement Over Hopper

Key Takeaways

Summary

Editorial Opinion

More from NVIDIA

China Bans Nvidia RTX 5090D V2 During CEO Huang's Visit, Escalating AI Hardware Trade War

GTAP Enables Transparent Remote GPU Access: Ollama Runs on MacBook with Remote Blackwell GPU

Researchers Discover Critical Confused Deputy Vulnerabilities in AI Accelerators Affecting 100+ Million Devices

Comments

Suggested

Google DeepMind Launches Gemini 3.5 Flash: New Lightweight AI Model

SID Achieves Search Breakthrough with SID-1, Outperforming GPT-5 at 1k+ QPS Using Reinforcement Learning

OpenAI Prepares for IPO After Musk Lawsuit Threat Clears

NVIDIA Unveils GB300 NVL72 with 50x Performance-Per-Watt Improvement Over Hopper

Key Takeaways

Summary

Editorial Opinion

More from NVIDIA

China Bans Nvidia RTX 5090D V2 During CEO Huang's Visit, Escalating AI Hardware Trade War

GTAP Enables Transparent Remote GPU Access: Ollama Runs on MacBook with Remote Blackwell GPU

Researchers Discover Critical Confused Deputy Vulnerabilities in AI Accelerators Affecting 100+ Million Devices

Comments

Suggested

Google DeepMind Launches Gemini 3.5 Flash: New Lightweight AI Model

SID Achieves Search Breakthrough with SID-1, Outperforming GPT-5 at 1k+ QPS Using Reinforcement Learning

OpenAI Prepares for IPO After Musk Lawsuit Threat Clears