Inside Canada's Fastest Supercomputer: Direct Die Liquid Cooling & 82 Million Dollar Build
Linus Tech TipsOctober 16, 202519 min1,004,766 views
23 connections·40 entities in this video→Supercomputer Overview and Scale
- 🚀 The video explores Canada's fastest supercomputer at Simon Fraser University, an $82 million CAD (approx. $60 million USD) facility.
- 💡 This powerhouse is designed to support research in fields ranging from AI to zoology for tens of thousands of scientists.
- 🧠 The supercomputer features a massive scale with 165,000 CPU cores and 20 million GPU cores, complemented by a petabyte of RAM.
GPU Compute Nodes and Cooling
- ⚡ A significant portion of the supercomputer comprises 640 Nvidia H100 GB GPUs, each costing around $31,000 USD.
- 💧 The nodes utilize a direct die liquid cooling system, achieving over 90% heat capture, a substantial improvement over traditional methods.
- 🧊 Every component, including CPUs, GPUs, VRMs, network interfaces, and even system memory, is directly liquid-cooled, eliminating the need for fans.
- 🌡️ The cooling loop involves complex manifold systems for GPUs, networking boards, power delivery, RAM, and CPUs, with specific routing for optimal heat dissipation.
CPU Nodes and Networking
- ⚙️ The supercomputer also includes 192-core CPU nodes (Zen 5 EPYC Turing cores) with 768 GB of memory per node, totaling nearly 400 cores per 1U unit.
- 🌐 Networking is handled by 200 gigabit connections with NDR, dynamically shared between nodes, and inter-rack connections of 800 gigabit for island clusters.
- 💾 Specialized nodes include 49 PB of spinning rust storage and 2 PB of NVMe storage, with some nodes offering up to 8 TB of RAM for specific jobs.
Data Center Infrastructure and Cooling Systems
- 🏭 The data center infrastructure includes racks fed by dual three-phase 60 amp feeds, capable of up to 70,000 watts per rack.
- 💧 A sophisticated cooling distribution system (CDU) from Vertiv manages up to 600,000 watts of cooling capacity per unit, using a liquid-to-liquid heat exchanger.
- 💨 Evaporative cooling towers provide the primary chilled water supply, augmented by mechanical chillers for times when ambient temperatures exceed 33°C.
- 🔊 Acoustic damping is applied to the cooling towers, and the facility's two-foot-thick concrete floor deflects half an inch under the weight of the equipment.
Funding and Partnerships
- 💰 The $82 million CAD budget was funded through contributions from the Digital Research Alliance of Canada, BCKDF, and vendor contributions (significant discounts).
- 🤝 Key partners and contributors mentioned include MSI, Lenovo, DDN, and Vertiv for cooling solutions.
- 💧 The facility uses AquaM tubing for its coolant lines, chosen for its durability and resistance to rust compared to stainless steel.
Knowledge graph40 entities · 23 connections
How they connect
An interactive map of every person, idea, and reference from this conversation. Hover to trace connections, click to explore.
Hover · drag to explore
40 entities
Chapters8 moments
Key Moments
Transcript72 segments
Full Transcript
Topics15 themes
What’s Discussed
SupercomputerDirect Die Liquid CoolingNvidia H100EPYC Genoa CPUsHBM3 MemoryData CenterHigh-Performance ComputingLiquid CoolingCooling Distribution Unit (CDU)Evaporative Cooling TowersNVMe Storage800Gbit NDRSimon Fraser UniversityAI ResearchGenomics Research
Smart Objects40 · 23 links
Companies· 4
Concepts· 15
Products· 20
Location· 1