The Lottery Ticket Hypothesis: Finding Sparse, Trainable Subset Neural Networks
[HPP] Jonathan FrankleNovember 11, 20255 min
9 connections·17 entities in this video→Understanding the Lottery Ticket Hypothesis
- 💡 The Lottery Ticket Hypothesis (LTH) proposes that large, randomly initialized neural networks contain "winning tickets"—sparse subnetworks that, when trained in isolation, can match the performance of the full network.
- 🎯 The core claim is that a trained dense network holds a subnetwork (potentially 5-20% of original parameters) that would achieve comparable accuracy if trained independently from its initial random state.
The Iterative Pruning Algorithm
- 🛠️ Winning tickets are identified using an iterative pruning algorithm which involves training a network, pruning the smallest weights, and crucially, resetting the remaining weights to their initial values.
- 🔑 This process is repeated until a target sparsity is reached, emphasizing that the initialization of weights is key, not just the architecture.
Key Experimental Findings
- 🚀 Winning tickets consistently learn faster and achieve higher test accuracy than networks with random reinitialization of the same sparse architecture, often generalizing better.
- 📊 These efficient subnetworks are typically found at 10% to 20% of the original network size and are effective across various datasets and architectures.
- ✅ The combination of architecture and initialization is critical for the success and efficiency of these winning tickets.
Implications for Neural Network Training
- ⚡ LTH has major implications for training efficiency, suggesting that finding winning tickets early could lead to significantly faster training times.
- 🧠 It also informs architecture design, as understanding the structure of winning tickets can guide the creation of better networks from scratch.
- 🔍 The hypothesis encourages a deeper theoretical understanding of why these winning tickets exist and what makes certain initializations
Knowledge graph17 entities · 9 connections
How they connect
An interactive map of every person, idea, and reference from this conversation. Hover to trace connections, click to explore.
Hover · drag to explore
17 entities
Chapters3 moments
Key Moments
Transcript21 segments
Full Transcript
Topics12 themes
What’s Discussed
Lottery Ticket HypothesisNeural NetworksSparse SubnetworksNetwork PruningWeight InitializationIterative Pruning AlgorithmTraining EfficiencyArchitecture DesignLearning Rate Warm-upOverparameterizationLoss LandscapeDeep Networks
Smart Objects17 · 9 links
Person· 1
Concepts· 12
Medias· 3
Event· 1