The State of Open Models
[HPP] Nathan LambertOctober 16, 202548 min
39 connectionsΒ·40 entities in this videoβThe Shifting Landscape of Open Models
- π‘ The open model ecosystem has seen a significant shift in 2025, with China emerging as a dominant force.
- π Llama's previous dominance has faded, while Chinese models like DeepSeek and Qwen have gained considerable momentum and adoption.
- π The DeepSeek R1 release was a pivotal moment, prompting a recalibration in the American AI ecosystem.
China's Dominance and Strategic Approaches
- π¨π³ Qwen (Alibaba) is rapidly releasing a full-stack offering across various modalities (text, speech, image editing, coding) and actively engaging the community.
- π¬ DeepSeek focuses on cutting-edge AGI research and developing models for enterprise use cases, driven by a world-class research team.
- π A robust ecosystem of Chinese labs (e.g., ZAI, Moonshot AI, Tencent, ByteDance) is growing, with many exploring diverse business models and receiving government funding.
- π Chinese models have surpassed US open models in performance and cumulative adoption, becoming the industry standard.
US Position and Challenges
- β οΈ US efforts like Google's Gemma and AI2's MOO 2 32B are positive but have not yet closed the gap with leading Chinese models.
- πΊπΈ OpenAI's GPT-OSS provides high-level cover for big tech to release open models but lacks consistent follow-through and developer engagement.
- π° Current US investment in open models, such as the NSF grant, is deemed insufficient to compete with frontier models, requiring a significant scale-up.
- π΅οΈββοΈ Many US startups and well-funded labs are using Chinese open-source models (e.g., Qwen) for innovation, often without public acknowledgment.
The Future of Open Models and Risks
- π Open models will persist globally, regardless of US control, as talent and resources for training diffuse across countries.
- π» Within two years, uncensored multimodal models (like a Sora 2 equivalent) will be runnable on personal devices, raising societal concerns.
- π¨ Chinese models may contain propaganda or unprovable vulnerabilities, leading larger American companies to avoid them, while startups prioritize innovation.
- βοΈ The debate between safety and openness is less urgent due to the current performance gap between frontier and open models, allowing for proactive risk assessment.
Recommendations for US Leadership
- π― The US should lead in open models to align with historical values, foster innovation, and prevent the concentration of power.
- π€ There is a need to establish next-generation, scaled-up model centers in the US, potentially through initiatives like the American Truly Open Models (ATOM) project.
- β US efforts should focus on transparency and building trust in model evaluations, leveraging academic connections, and addressing the disadvantage of not using all available data.
Knowledge graph40 entities Β· 39 connections
How they connect
An interactive map of every person, idea, and reference from this conversation. Hover to trace connections, click to explore.
Hover Β· drag to explore
40 entities
Chapters20 moments
Key Moments
Transcript178 segments
Full Transcript
Topics15 themes
Whatβs Discussed
Open ModelsDeepSeekQwenLlamaChinese AI EcosystemUS AI EcosystemGPT-OSSMultimodal ModelsFrontier ModelsAI PolicyHugging FaceModel Performance BenchmarksAI SafetySovereign AIAmerican Truly Open Models (ATOM)
Smart Objects40 Β· 39 links
MediasΒ· 9
CompaniesΒ· 9
ProductsΒ· 6
ConceptsΒ· 12
LocationsΒ· 3
PersonΒ· 1