Modular AI Models: Adding Languages Like Lego Blocks with Pathway's Adrian Kosowski

Super Data Science: ML & AI Podcast with Jon KrohnOctober 13, 20258 min423 views

18 connections·30 entities in this video→

Capture as you watch

Save any video to veridive in one click.

The free veridive Chrome extension pulls the transcript from any YouTube video or podcast you're watching — ready to ask, cite, and connect.

Modular AI Architecture: BDH

💡 The BDH architecture allows for the concatenation of separate language models, such as English and French, to create a multilingual model with sparse activation.
🧠 This approach contrasts with transformers, where connecting models is not straightforward, offering a simpler scaling dimension.

Performance and Scale

🚀 BDH models, like the 1 billion parameter "baby dragon hatchling" model, perform comparably to or outperform existing models of similar scale, such as GPT-2.
⚙️ A key advantage is the energy and compute efficiency achieved by the BDH architecture.
🧪 The focus on moderate scale (1B parameters) facilitates ease and speed of experimentation, particularly for instruction following and basic language model capabilities.

Reasoning Models and Future Potential

🎯 The most promising avenue for BDH is in developing reasoning models that involve multiple phases of refinement and accuracy checks.
📈 These models are adept at working with contextualized inputs and can process vast amounts of data, potentially billions of tokens.
💻 A key use case is an AI coding assistant that can understand and operate within large codebases, internalizing existing code before generating new contributions.
📚 The architecture can ingest and make sense of large datasets, such as private enterprise documentation, in a matter of minutes.

Ask, don't scrub

Discover the spoken web.

veridive answers questions with exact timestamps and citations — across every podcast, video, and article you've saved.

Knowledge graph30 entities · 18 connections

How they connect

An interactive map of every person, idea, and reference from this conversation. Hover to trace connections, click to explore.

Hover · drag to explore

30 entities

Chapters4 moments

Key Moments

Transcript29 segments

Full Transcript

Follow the thread

Find every place these ideas show up.

veridive maps the same people, claims, and topics across thousands of sources — so you can trace an idea from one conversation to the next.

Topics13 themes

What’s Discussed

Modular AI ModelsBDH ArchitecturePathway AIMultilingual ModelsSparse ActivationTransformer LimitationsReasoning ModelsLarge Language ModelsCompute EfficiencyEnergy EfficiencyContextualized InputsAI Coding AssistantScalability

Smart Objects30 · 18 links

Concepts· 20

Products· 6

Medias· 3

Company· 1

Hours of content, seconds to the answer.

Save what you listen to. Ask it anything. Watch the threads between sources surface on their own.

Get started free