Residual Networks (ResNets): The Breakthrough That Fixed Deep Learning

[HPP] Kaiming HeDecember 16, 202511 min

24 connections·37 entities in this video→

Capture as you watch

Save any video to veridive in one click.

The free veridive Chrome extension pulls the transcript from any YouTube video or podcast you're watching — ready to ask, cite, and connect.

The Deep Learning Degradation Problem

⚠️ In the mid-2010s, adding more layers to deep neural networks paradoxically made them perform worse, a phenomenon known as the degradation problem.
📉 This issue was not due to vanishing gradients or overfitting, but rather a fundamental failure in optimization, where deeper networks simply couldn't learn effectively.
🧠 For instance, a 56-layer plain convolutional network performed worse than its 20-layer counterpart, challenging the intuition that depth equals better performance.

The Breakthrough of Residual Learning

💡 Researchers at Microsoft Research proposed a revolutionary idea: instead of learning a full complex mapping h(x), networks should learn only the residual or difference f(x) = h(x) - x.
🔑 This means the network's output becomes f(x) + x, allowing layers to make small adjustments to the input rather than entirely re-learning the representation.
🌱 This concept gave birth to the Residual Network (ResNet), a pivotal architecture in machine learning history.

How Skip Connections Transformed Depth

🚀 The core of ResNets is the skip connection (or identity shortcut), which directly adds the original input x to the output of the convolutional layers f(x).
✅ This simple addition creates a direct path for information and gradients, preventing distortion and allowing them to flow freely through very deep networks.
📈 With skip connections, adding more layers improved performance rather than hurting it, enabling the training of networks with hundreds or even thousands of layers.

ResNet's Impact and Achievements

🏆 The 152-layer ResNet famously dominated the ImageNet 2015 competition, achieving a superhuman 3.57% top-five error rate.
📊 ResNets also significantly boosted object detection tasks, such as Faster R-CNN on Pascal VOC and MS COCO, by simply replacing the backbone network.
🔬 Experimental analysis showed that deeper ResNets produce smaller residual outputs, confirming that each layer performs subtle, intelligent refinements.

Legacy and Future Influence

🌐 The idea of shortcut connections and residual pathways is now fundamental, influencing modern architectures like DenseNet, ResNeXt, SENet, EfficientNet, and even many Transformers.
🌟 ResNets didn't just solve a technical problem; they opened a new frontier for deep learning, making today's massive AI systems possible.
✍️ The core concept, Output = f(x) + x, is a simple yet profoundly transformative equation that reshaped the understanding and development of neural networks.

Ask, don't scrub

Discover the spoken web.

veridive answers questions with exact timestamps and citations — across every podcast, video, and article you've saved.

Knowledge graph37 entities · 24 connections

How they connect

An interactive map of every person, idea, and reference from this conversation. Hover to trace connections, click to explore.

Hover · drag to explore

37 entities

Chapters5 moments

Key Moments

Transcript41 segments

Full Transcript

Follow the thread

Find every place these ideas show up.

veridive maps the same people, claims, and topics across thousands of sources — so you can trace an idea from one conversation to the next.

Topics15 themes

What’s Discussed

Deep Neural NetworksDegradation ProblemResidual Networks (ResNets)Residual LearningSkip ConnectionsImageNet CompetitionComputer VisionOptimizationObject DetectionBottleneck DesignGradientsOverfittingBatch NormalizationTransformersConvolutional Networks

Smart Objects37 · 24 links

Medias· 13

Concepts· 16

Company· 1

People· 2

Events· 2

Products· 3

Hours of content, seconds to the answer.

Save what you listen to. Ask it anything. Watch the threads between sources surface on their own.

Get started free