Skip to main content

The Lottery Ticket Hypothesis: Finding Sparse, Trainable Subset Neural Networks

[HPP] Jonathan FrankleNovember 11, 20255 min
9 connections·17 entities in this video→

Understanding the Lottery Ticket Hypothesis

  • πŸ’‘ The Lottery Ticket Hypothesis (LTH) proposes that large, randomly initialized neural networks contain "winning tickets"β€”sparse subnetworks that, when trained in isolation, can match the performance of the full network.
  • 🎯 The core claim is that a trained dense network holds a subnetwork (potentially 5-20% of original parameters) that would achieve comparable accuracy if trained independently from its initial random state.

The Iterative Pruning Algorithm

  • πŸ› οΈ Winning tickets are identified using an iterative pruning algorithm which involves training a network, pruning the smallest weights, and crucially, resetting the remaining weights to their initial values.
  • πŸ”‘ This process is repeated until a target sparsity is reached, emphasizing that the initialization of weights is key, not just the architecture.

Key Experimental Findings

  • πŸš€ Winning tickets consistently learn faster and achieve higher test accuracy than networks with random reinitialization of the same sparse architecture, often generalizing better.
  • πŸ“Š These efficient subnetworks are typically found at 10% to 20% of the original network size and are effective across various datasets and architectures.
  • βœ… The combination of architecture and initialization is critical for the success and efficiency of these winning tickets.

Implications for Neural Network Training

  • ⚑ LTH has major implications for training efficiency, suggesting that finding winning tickets early could lead to significantly faster training times.
  • 🧠 It also informs architecture design, as understanding the structure of winning tickets can guide the creation of better networks from scratch.
  • πŸ” The hypothesis encourages a deeper theoretical understanding of why these winning tickets exist and what makes certain initializations
Knowledge graph17 entities Β· 9 connections

How they connect

An interactive map of every person, idea, and reference from this conversation. Hover to trace connections, click to explore.

Hover Β· drag to explore
17 entities
Chapters3 moments

Key Moments

Transcript21 segments

Full Transcript

Topics12 themes

What’s Discussed

Lottery Ticket HypothesisNeural NetworksSparse SubnetworksNetwork PruningWeight InitializationIterative Pruning AlgorithmTraining EfficiencyArchitecture DesignLearning Rate Warm-upOverparameterizationLoss LandscapeDeep Networks
Smart Objects17 Β· 9 links
PersonΒ· 1
ConceptsΒ· 12
MediasΒ· 3
EventΒ· 1