Stanford
University
  • Stanford Home
  • Maps & Directions
  • Search Stanford
  • Emergency Info
  • Terms of Use
  • Privacy
  • Copyright
  • Trademarks
  • Non-Discrimination
  • Accessibility
© Stanford University.  Stanford, California 94305.
Beyond DeepSeek: China's Diverse Open-Weight AI Ecosystem and Its Policy Implications | Stanford HAI

Stay Up To Date

Get the latest news, advances in research, policy work, and education program updates from HAI in your inbox weekly.

Sign Up For Latest News

Navigate
  • About
  • Events
  • Careers
  • Search
Participate
  • Get Involved
  • Support HAI
  • Contact Us
Skip to content
  • About

    • About
    • People
    • Get Involved with HAI
    • Support HAI
    • Subscribe to Email
  • Research

    • Research
    • Fellowship Programs
    • Grants
    • Student Affinity Groups
    • Centers & Labs
    • Research Publications
    • Research Partners
  • Education

    • Education
    • Executive and Professional Education
    • Government and Policymakers
    • K-12
    • Stanford Students
  • Policy

    • Policy
    • Policy Publications
    • Policymaker Education
    • Student Opportunities
  • AI Index

    • AI Index
    • AI Index Report
    • Global Vibrancy Tool
    • People
  • News
  • Events
  • Industry
  • Centers & Labs
policyIssue Brief

Beyond DeepSeek: China's Diverse Open-Weight AI Ecosystem and Its Policy Implications

Date
December 16, 2025
Topics
Foundation Models
International Affairs, International Security, International Development
Read Paper
abstract

Almost one year after the “DeepSeek moment,” this brief analyzes China’s diverse open-model ecosystem and examines the policy implications of their widespread global diffusion.

In collaboration with

Executive Summary

  • After years of lagging behind, Chinese AI models — especially open-weight LLMs — seem to have caught up or even pulled ahead of their global counterparts in advanced AI model capabilities and adoption.

  • We profile and compare the capabilities and distinct features of four notable Chinese open-weight language model families, highlighting that China’s ecosystem of open-weight LLMs is driven by a wide range of actors who are prioritizing the development of computationally efficient models optimized for flexible downstream deployment.

  • Diverse commercial strategies for translating open-weight model adoption into business success are emerging, yet their long-term viability remains uncertain.

  • The Chinese government’s support of open-weight model development — while not the sole determinant of its success — has played a substantial role, though there is no guarantee it will continue.

  • The widespread global adoption of Chinese open-weight models may reshape global technology access and reliance patterns, and impact AI governance, safety, and competition. Policymakers should ground their policy actions in a granular understanding of real-world deployment.

Introduction

In 2022, China’s AI developer community faced dual shocks from the United States. In October, the U.S. government imposed unilateral export controls on semiconductor manufacturing equipment and the most powerful chips for large language model (LLM) training. The following month, OpenAI brought state-of-the-art LLM technology to broad public attention with the launch of ChatGPT. Chinese commentators, noting the government launched a comprehensive plan for AI development five years earlier, asked why breakthroughs were not happening in China, and how Chinese developers could compete with the United States.

As Chinese labs scrambled to develop and release their own models, open-weight LLMs quickly took center stage. Initially, many labs built their models on top of Meta’s open-weight Llama model and its architecture. But China’s state of the art would not be reliant on U.S.-trained models for long. A little more than two years after ChatGPT took many in China by surprise, a Hangzhou-based startup called DeepSeek released its R1 reasoning model, demonstrating capability and efficiency that shocked global investors and caused the AI chip leader Nvidia to suffer the biggest single-day loss of any company in U.S. stock market history. While keen analysts had monitored DeepSeek’s early development, and some of its methods were known to AI scientists around the world, the broader U.S. discourse did not see the lab’s success coming.

Now the strength of Chinese developers in open-weight language models is more widely recognized. DeepSeek has upgraded its models and promises further releases. Alibaba’s Qwen models, which were in development long before DeepSeek’s breakout moment, are widely used by developers around the world. Another Chinese tech giant, Baidu, which had been pursuing a closed-model strategy, has turned to releasing the weights of some of its flagship models openly. Today, Chinese-made open-weight models are unavoidable in the global competitive AI landscape. 

This brief analyzes China’s diverse open-model ecosystem, looking beyond DeepSeek. At a moment when Chinese models are increasingly being adopted around the world, including in the United States, and when policymakers are weighing measures to restrict who can build at the AI cutting edge, we pause to give context to the technical and commercial realities of leading Chinese AI labs against a constantly evolving geopolitical backdrop. We then dive deeper into the implications of Chinese open-weight model diffusion for technological development and policy interests in China, the United States, and the rest of the world — implications that raise a series of policy issues that policymakers, scholars, and AI developers might consider going forward. We focus on language models but note the strong performance of other kinds of open-weight AI such as speech, visual, or video models, which some developers integrate with their leading language models.

Read Paper
Share
Link copied to clipboard!
Authors
  • headshot
    Caroline Meinhardt
  • Sabina Nong
    Sabina Nong
  • Graham Webster
    Graham Webster
  • Tatsunori Hashimoto
    Tatsunori Hashimoto
  • Chris Manning headshot
    Christopher Manning

Related Publications

Moving Beyond the Term "Global South" in AI Ethics and Policy
Evani Radiya-Dixit, Angèle Christin
Quick ReadNov 19, 2025
Issue Brief

This brief examines the limitations of the term "Global South" in AI ethics and policy, and highlights the importance of grounding such work in specific regions and power structures.

Issue Brief

Moving Beyond the Term "Global South" in AI Ethics and Policy

Evani Radiya-Dixit, Angèle Christin
Ethics, Equity, InclusionInternational Affairs, International Security, International DevelopmentQuick ReadNov 19

This brief examines the limitations of the term "Global South" in AI ethics and policy, and highlights the importance of grounding such work in specific regions and power structures.

Yejin Choi’s Briefing to the United Nations Security Council
Yejin Choi
Quick ReadSep 24, 2025
Testimony

In this address, presented to the United Nations Security Council meeting on "Maintenance of International Peace and Security," Yejin Choi calls on the global scientific and policy communities to expand the AI frontier for all by pursuing intelligence that is not only powerful, but also accessible, robust, and efficient. She stresses the need to rethink our dependence on massive-scale data and computing resources from the outset, and design methods that do more with less — by building AI that is smaller and serves all communities.

Testimony

Yejin Choi’s Briefing to the United Nations Security Council

Yejin Choi
International Affairs, International Security, International DevelopmentQuick ReadSep 24

In this address, presented to the United Nations Security Council meeting on "Maintenance of International Peace and Security," Yejin Choi calls on the global scientific and policy communities to expand the AI frontier for all by pursuing intelligence that is not only powerful, but also accessible, robust, and efficient. She stresses the need to rethink our dependence on massive-scale data and computing resources from the outset, and design methods that do more with less — by building AI that is smaller and serves all communities.

Validating Claims About AI: A Policymaker’s Guide
Olawale Salaudeen, Anka Reuel, Angelina Wang, Sanmi Koyejo
Quick ReadSep 24, 2025
Policy Brief

This brief proposes a practical validation framework to help policymakers separate legitimate claims about AI systems from unsupported claims.

Policy Brief

Validating Claims About AI: A Policymaker’s Guide

Olawale Salaudeen, Anka Reuel, Angelina Wang, Sanmi Koyejo
Foundation ModelsPrivacy, Safety, SecurityQuick ReadSep 24

This brief proposes a practical validation framework to help policymakers separate legitimate claims about AI systems from unsupported claims.

Policy Implications of DeepSeek AI’s Talent Base
Amy Zegart, Emerson Johnston
Quick ReadMay 06, 2025
Policy Brief

This brief presents an analysis of Chinese AI startup DeepSeek’s talent base and calls for U.S. policymakers to reinvest in competing to attract and retain global AI talent.

Policy Brief

Policy Implications of DeepSeek AI’s Talent Base

Amy Zegart, Emerson Johnston
International Affairs, International Security, International DevelopmentFoundation ModelsWorkforce, LaborQuick ReadMay 06

This brief presents an analysis of Chinese AI startup DeepSeek’s talent base and calls for U.S. policymakers to reinvest in competing to attract and retain global AI talent.