Skip to main content

The Lottery Ticket Hypothesis: Finding Sparse, Trainable Subset Neural Networks

[HPP] Jonathan FrankleNovember 11, 20255 min
9 connections·17 entities in this video

Understanding the Lottery Ticket Hypothesis

  • 💡 The Lottery Ticket Hypothesis (LTH) proposes that large, randomly initialized neural networks contain "winning tickets"—sparse subnetworks that, when trained in isolation, can match the performance of the full network.
  • 🎯 The core claim is that a trained dense network holds a subnetwork (potentially 5-20% of original parameters) that would achieve comparable accuracy if trained independently from its initial random state.

The Iterative Pruning Algorithm

  • 🛠️ Winning tickets are identified using an iterative pruning algorithm which involves training a network, pruning the smallest weights, and crucially, resetting the remaining weights to their initial values.
  • 🔑 This process is repeated until a target sparsity is reached, emphasizing that the initialization of weights is key, not just the architecture.

Key Experimental Findings

  • 🚀 Winning tickets consistently learn faster and achieve higher test accuracy than networks with random reinitialization of the same sparse architecture, often generalizing better.
  • 📊 These efficient subnetworks are typically found at 10% to 20% of the original network size and are effective across various datasets and architectures.
  • ✅ The combination of architecture and initialization is critical for the success and efficiency of these winning tickets.

Implications for Neural Network Training

  • ⚡ LTH has major implications for training efficiency, suggesting that finding winning tickets early could lead to significantly faster training times.
  • 🧠 It also informs architecture design, as understanding the structure of winning tickets can guide the creation of better networks from scratch.
  • 🔍 The hypothesis encourages a deeper theoretical understanding of why these winning tickets exist and what makes certain initializations
Knowledge graph17 entities · 9 connections

How they connect

An interactive map of every person, idea, and reference from this conversation. Hover to trace connections, click to explore.

Hover · drag to explore
17 entities
Chapters3 moments

Key Moments

Transcript21 segments

Full Transcript

Topics12 themes

What’s Discussed

Lottery Ticket HypothesisNeural NetworksSparse SubnetworksNetwork PruningWeight InitializationIterative Pruning AlgorithmTraining EfficiencyArchitecture DesignLearning Rate Warm-upOverparameterizationLoss LandscapeDeep Networks
Smart Objects17 · 9 links
Person· 1
Concepts· 12
Medias· 3
Event· 1