State of AI: An Empirical 100 Trillion Token Study with OpenRouter
[HPP] Anjney MidhaDecember 5, 202515 min
21 connectionsΒ·40 entities in this videoβThe Evolution of LLMs
- π‘ The field has shifted from single-pass pattern generation to multi-step deliberation inference since the release of the
o1model on December 5th, 2024. - π§ This means models can now stop, think, sketch out a plan, and check their own work, acting as analytical partners rather than just prediction engines.
- π An empirical study by OpenRouter analyzed over 100 trillion tokens of real-world LLM interactions, spanning a year up to November 2025.
Key Usage Trends
- π Creative roleplay accounts for over 50% of open-source model token usage, highlighting a preference for creative freedom over commercial safety layers.
- π» Programming is the second largest use case and the dominant driver of complexity, with queries often exceeding 20,000 input tokens for tasks like feeding entire codebases.
- π The "medium market" of models between 15 and 70 billion parameters (e.g.,
Quen 2.5,Coder 32B) is finding optimal "model market fit" for balancing capability and efficiency.
Rise of Agentic Inference
- β‘ Agentic inference is becoming the default, enabling models to plan, reason, and execute complex multi-step workflows.
- π This is evidenced by over 50% of tokens routed through reasoning-optimized models and consistent adoption of tool use (e.g., search engines, code interpreters).
- π The average number of prompt tokens per request has grown fourfold, indicating users are providing vast amounts of context like code and logs.
Market Dynamics & Retention
- π The Cinderella "Glass Slipper" effect describes persistent user retention when a specific high-value workload perfectly matches a model's capabilities.
- π° There is a very weak correlation between price and demand for LLMs; users prioritize a model's ability to solve problems over its cost.
- π The market is becoming increasingly global, with Asia's share of global spend more than doubling and Simplified Chinese becoming the largest non-English language for tokens.
Future Outlook
- π― The industry is moving away from optimizing for cost per token towards optimizing for the cost per successful outcome.
- β The market is highly segmented into archetypes like mass market volume drivers, efficient giants, premium leaders, and premium workload categories.
- π€ A multi-model, pluralistic ecosystem is emerging, where integration and unique capabilities trump cost, forcing providers to differentiate.
Knowledge graph40 entities Β· 21 connections
How they connect
An interactive map of every person, idea, and reference from this conversation. Hover to trace connections, click to explore.
Hover Β· drag to explore
40 entities
Chapters2 moments
Key Moments
Transcript56 segments
Full Transcript
Topics15 themes
Whatβs Discussed
Large Language Models (LLMs)OpenRouterEmpirical StudyReasoning ModelsAgentic InferenceOpen-source ModelsCreative RoleplayProgramming QueriesModel Market FitTool UseUser RetentionPrice ElasticityMarket SegmentationCost Per Successful OutcomeGlobal LLM Market
Smart Objects40 Β· 21 links
ProductsΒ· 6
ConceptsΒ· 24
LocationΒ· 1
MediaΒ· 1
PeopleΒ· 3
CompaniesΒ· 4
EventΒ· 1