Nvidia Grace Blackwell NVLink72: 10x Performance & Lowest Cost AI Tokens
[HPP] Jensen HuangNovember 20, 20259 min
15 connectionsΒ·21 entities in this videoβRevolutionary Grace Blackwell Architecture
- π Nvidia's Grace Blackwell NVLink72 is an extreme code-designed computer, representing a ground-up reinvention in modern computing.
- π‘ This architecture integrates 72 GPUs into one giant rack, enabling them to function cohesively as a single, powerful unit.
- π§ It efficiently handles gigantic AI models by distributing tasks among numerous "experts" across its GPUs.
Unprecedented Performance & Efficiency
- β‘ The Grace Blackwell per GPU delivers 10 times the performance compared to the H200, achieved through extreme code design, not just more transistors.
- π― This significant speed difference is due to each NVLink72 GPU handling only four experts, a stark contrast to older systems managing 32 experts per GPU.
Lowest Cost Token Generation
- π° Despite being the most expensive computer, the Grace Blackwell NVLink72 generates the world's lowest cost tokens.
- β Its superior token generation capability leads to 10 times lower cost per token, creating a powerful virtuous cycle of performance and cost efficiency.
Driving Accelerated Computing Shift
- π The industry is undergoing a fundamental shift from general-purpose to accelerated computing, independent of AI.
- π This shift benefits various tasks like data processing, image processing, and classical machine learning algorithms, making accelerated computing a broad necessity.
- π€ Major Cloud Service Providers (CSPs) are heavily investing in this new architecture to achieve the best Total Cost of Ownership (TCO).
Strategic Market Position
- π Nvidia's GPUs uniquely perform all accelerated computing tasks plus AI, unlike specialized ASICs.
- π± This versatility positions Nvidia at an inflection point, making its architecture a secure and strategic investment for the future of computing.
Knowledge graph21 entities Β· 15 connections
How they connect
An interactive map of every person, idea, and reference from this conversation. Hover to trace connections, click to explore.
Hover Β· drag to explore
21 entities
Chapters4 moments
Key Moments
Transcript33 segments
Full Transcript
Topics14 themes
Whatβs Discussed
Grace Blackwell NVLink72AI ModelsGPU TechnologyAccelerated ComputingExtreme Code DesignToken GenerationTotal Cost of Ownership (TCO)Cloud Service Providers (CSPs)Machine Learning AlgorithmsPerformance OptimizationSupply ChainInflection PointASICsGeneral-Purpose Computing
Smart Objects21 Β· 15 links
CompaniesΒ· 3
ProductsΒ· 10
ConceptsΒ· 8