AgentScout Logo Agent Scout

China LLM Filing: Three-Tier Risk System, 3-Month Review

China's 2026 LLM filing system introduces three-tier risk classification with quantitative thresholds. High-risk models face up to 3-month expert panel review.

AgentScout · · · 4 min read
#china #llm-regulation #ai-governance #risk-classification #filing-system
Analyzing Data Nodes...
SIG_CONF:CALCULATING
Verified Sources

Key Facts

  • Who: China’s regulatory authorities governing large language models
  • What: Three-tier risk classification system with quantitative compliance thresholds
  • When: Filing system effective 2026
  • Impact: Applies to all LLMs deployed in China regardless of origin

TL;DR

China has implemented a three-tier risk classification system for large language model filing, requiring high-risk models to undergo expert panel review lasting up to 3 months. The system establishes quantitative thresholds including a 95% refusal rate for sensitive content and a 30% maximum for overseas training data, affecting all LLMs deployed in China.

What Changed

China’s 2026 LLM filing system introduces a structured regulatory framework categorizing models into three risk tiers: standard, medium, and high-risk. The system, detailed in official guidelines, applies to all large language models deployed within China regardless of their country of origin.

The classification determines both the review process and the timeline for regulatory approval:

  • Standard-risk models: Streamlined filing process with automated review
  • Medium-risk models: Enhanced documentation requirements with departmental review
  • High-risk models: Expert panel review requiring up to 3 months

According to the CSDN report, the filing requirements include two critical quantitative thresholds that will determine a model’s compliance status.

Why It Matters

The regulatory framework establishes specific, measurable criteria for LLM compliance in China:

Quantitative Thresholds

MetricThresholdScope
Content safety refusal rate≥ 95%Sensitive query detection
Overseas training data≤ 30%Training dataset composition

The 95% refusal rate threshold requires models to accurately identify and refuse responses to sensitive content categories. This metric demands sophisticated content moderation systems with high precision and recall for Chinese-language content boundaries.

The 30% cap on overseas training data introduces supply chain constraints for model developers. International LLM providers relying on globally-sourced datasets face restructuring requirements or potential market exclusion.

Review Timeline Impact

Risk LevelReview DurationReview Body
StandardDays to weeksAutomated/Departmental
MediumWeeks to 1 monthDepartmental
HighUp to 3 monthsExpert panel

The extended timeline for high-risk models creates go-to-market delays that affect commercial planning. Models classified as high-risk face a quarter-year approval window, requiring advance regulatory strategy.

🔺 Scout Intel: What Others Missed

Confidence: medium | Novelty Score: 68/100

The filing system’s quantitative thresholds create a regulatory asymmetry that favors domestic Chinese model developers. The 30% overseas training data cap disproportionately impacts foreign LLM providers who typically train on globally-sourced corpora—OpenAI’s GPT models, Anthropic’s Claude, and Google’s Gemini all derive significant training data from English-language sources. Meanwhile, Chinese domestic models like Baidu’s ERNIE and Alibaba’s Qwen already operate with predominantly domestic data, positioning them for smoother compliance.

The 95% refusal rate threshold requires culturally-specific content moderation infrastructure that cannot be imported from Western safety systems. Models trained primarily on non-Chinese data lack the linguistic and cultural context to achieve this threshold, creating a technical barrier that compounds the data composition requirement.

Key Implication: Foreign LLM providers seeking China market access face a dual compliance burden—dataset restructuring to meet the 30% cap, plus custom content moderation development for the 95% refusal threshold—that effectively necessitates a China-specific model variant, raising operational costs and delaying deployment timelines.

What This Means

For LLM Developers Entering China

Companies planning to deploy LLMs in China must evaluate their risk classification early. Models handling sensitive domains—healthcare, finance, legal, education—face higher likelihood of medium or high-risk designation. Pre-filing assessment should prioritize:

  1. Dataset audit: Quantify overseas training data percentage and plan remediation if exceeding 30%
  2. Content safety testing: Benchmark refusal rates against the 95% threshold using Chinese-language test sets
  3. Timeline planning: Factor potential 3-month review delay into commercial launch schedules

For Domestic Chinese AI Companies

The regulatory framework provides clarity that enables investment decisions. Companies with domestic training data pipelines and Chinese-language content moderation systems gain compliance advantages. This may accelerate the divergence between China-market LLMs and global models, reinforcing the development of parallel AI ecosystems.

What to Watch

  • Enforcement patterns: How regulators apply risk classifications in practice, particularly borderline cases
  • Foreign provider responses: Whether international AI companies develop China-specific model variants or exit the market
  • Threshold adjustments: Potential future changes to the 95% refusal rate or 30% overseas data caps based on industry feedback

Sources

China LLM Filing: Three-Tier Risk System, 3-Month Review

China's 2026 LLM filing system introduces three-tier risk classification with quantitative thresholds. High-risk models face up to 3-month expert panel review.

AgentScout · · · 4 min read
#china #llm-regulation #ai-governance #risk-classification #filing-system
Analyzing Data Nodes...
SIG_CONF:CALCULATING
Verified Sources

Key Facts

  • Who: China’s regulatory authorities governing large language models
  • What: Three-tier risk classification system with quantitative compliance thresholds
  • When: Filing system effective 2026
  • Impact: Applies to all LLMs deployed in China regardless of origin

TL;DR

China has implemented a three-tier risk classification system for large language model filing, requiring high-risk models to undergo expert panel review lasting up to 3 months. The system establishes quantitative thresholds including a 95% refusal rate for sensitive content and a 30% maximum for overseas training data, affecting all LLMs deployed in China.

What Changed

China’s 2026 LLM filing system introduces a structured regulatory framework categorizing models into three risk tiers: standard, medium, and high-risk. The system, detailed in official guidelines, applies to all large language models deployed within China regardless of their country of origin.

The classification determines both the review process and the timeline for regulatory approval:

  • Standard-risk models: Streamlined filing process with automated review
  • Medium-risk models: Enhanced documentation requirements with departmental review
  • High-risk models: Expert panel review requiring up to 3 months

According to the CSDN report, the filing requirements include two critical quantitative thresholds that will determine a model’s compliance status.

Why It Matters

The regulatory framework establishes specific, measurable criteria for LLM compliance in China:

Quantitative Thresholds

MetricThresholdScope
Content safety refusal rate≥ 95%Sensitive query detection
Overseas training data≤ 30%Training dataset composition

The 95% refusal rate threshold requires models to accurately identify and refuse responses to sensitive content categories. This metric demands sophisticated content moderation systems with high precision and recall for Chinese-language content boundaries.

The 30% cap on overseas training data introduces supply chain constraints for model developers. International LLM providers relying on globally-sourced datasets face restructuring requirements or potential market exclusion.

Review Timeline Impact

Risk LevelReview DurationReview Body
StandardDays to weeksAutomated/Departmental
MediumWeeks to 1 monthDepartmental
HighUp to 3 monthsExpert panel

The extended timeline for high-risk models creates go-to-market delays that affect commercial planning. Models classified as high-risk face a quarter-year approval window, requiring advance regulatory strategy.

🔺 Scout Intel: What Others Missed

Confidence: medium | Novelty Score: 68/100

The filing system’s quantitative thresholds create a regulatory asymmetry that favors domestic Chinese model developers. The 30% overseas training data cap disproportionately impacts foreign LLM providers who typically train on globally-sourced corpora—OpenAI’s GPT models, Anthropic’s Claude, and Google’s Gemini all derive significant training data from English-language sources. Meanwhile, Chinese domestic models like Baidu’s ERNIE and Alibaba’s Qwen already operate with predominantly domestic data, positioning them for smoother compliance.

The 95% refusal rate threshold requires culturally-specific content moderation infrastructure that cannot be imported from Western safety systems. Models trained primarily on non-Chinese data lack the linguistic and cultural context to achieve this threshold, creating a technical barrier that compounds the data composition requirement.

Key Implication: Foreign LLM providers seeking China market access face a dual compliance burden—dataset restructuring to meet the 30% cap, plus custom content moderation development for the 95% refusal threshold—that effectively necessitates a China-specific model variant, raising operational costs and delaying deployment timelines.

What This Means

For LLM Developers Entering China

Companies planning to deploy LLMs in China must evaluate their risk classification early. Models handling sensitive domains—healthcare, finance, legal, education—face higher likelihood of medium or high-risk designation. Pre-filing assessment should prioritize:

  1. Dataset audit: Quantify overseas training data percentage and plan remediation if exceeding 30%
  2. Content safety testing: Benchmark refusal rates against the 95% threshold using Chinese-language test sets
  3. Timeline planning: Factor potential 3-month review delay into commercial launch schedules

For Domestic Chinese AI Companies

The regulatory framework provides clarity that enables investment decisions. Companies with domestic training data pipelines and Chinese-language content moderation systems gain compliance advantages. This may accelerate the divergence between China-market LLMs and global models, reinforcing the development of parallel AI ecosystems.

What to Watch

  • Enforcement patterns: How regulators apply risk classifications in practice, particularly borderline cases
  • Foreign provider responses: Whether international AI companies develop China-specific model variants or exit the market
  • Threshold adjustments: Potential future changes to the 95% refusal rate or 30% overseas data caps based on industry feedback

Sources

arq371unrukvfd4qh6ounr░░░8uih6bfzp1x96jifdnk8grsfcdar5sh░░░fnj03cv60qng81mzpay18kvdzevqsn7hl████aepycaaysl6efqqo0nu2uiy8ufnqpesle░░░v1lm88hzxis081bglfdgi1fly5qh81xt████bmecylwgz3uk2hz6dmg76pluptmf7ac1e████lk0v8lrulelnj9yxv71zmu0u3o5xsb████b63bma5ezkff33wjaldxlvy8xwhcgik████95zbmjk34jr8xyykr7u3dr8f94xfsgj9e░░░u23nabed2mogh557c3imlabnh056irptj░░░2ylpnw2e5nhb9g3hrwo2gr9x3sldoffrb████a56mj38652r2rozikznbldd7fk5ok357████x6kgy8iakk9uj9qy4tac2pznls5wka2n████k30slmi1x1lmxae2x0oqcnit1t710kn7████34lmokmjw0mbclis9bjxk7hhy9gptvtjd░░░y6bps8ovn93bczrw51qo8m35q798y7l████lo1o77zlisf29lo0ldlo3vl8tqpdsc░░░vkn3ls3e7kx57r7o663u2w0n2jgcc4x████gvltht5m80qmucvv7z78qlk55yop1ki░░░zh1056lbiar9ydf6xwj2dji0bdphbzh1████io18atttfjjk2i7er2dglmtsv4v302b9████wlno3y4rlsi1d8lns5h82ih2kx2gwui2j████99tiqlns8mnkf3uqa7di2a92wok9ykuam████40l7xtxpgt3lkpts87bmkem828amfmh4f████7isy1zhscyrup1ic56332pyfqnsacc7u░░░ewu9wnxi4w7ui64sofoalbluicrxphcf░░░uz9oep7k16ijn8wmwn6coty031rj19d░░░uawyp5agfnd26b1lxp3j6i23kdtpg0jfm████wnmdvt4agzhygaxje6k1ywdw2et42hae░░░6vzw82cr83i5inhufg73iffwbonvs2cxj████sbldn9x1pnf8us3elqekqb1764whzm362████yxbtxrflpxf25zyh9iisolmugjvnqkmuk████ljorklyeidg03yv7a1t57bejb1oze2wld9████93t1sgm77gueemy9nnsrknjkpzg3v5hza████4uwagx8lhrvuppf3hc3zl812wr9wnwoj4████3yhrgp6co73wckaa0ktwiki2r5gkyq7░░░bwag5mj5x99bzfobtww9tjxi4i2zemu4d░░░8ualmwh2tz2r9p4jcb9rxdik8cg6yvbl░░░5cx8hccua2h837yxp8boohpmpuw9ib99c░░░9e27jcdyhqjq7v4njy7xt1ko1tndfay6████t9kt962bz80ptny6oi226dnzjgymlreio░░░eb9rrf1ifmdhyz9pth9bbii5f9ne2afc9░░░evmhp6n2k4574cn3qyxhgnt65tw7sjs░░░aa76rtq5d9o9jerrtx17d9uho6nu92kjq░░░b7orefiu2t6hpaeshne334ezi4odjw6wl████7ffq118naw7uvhcqam83t1d8fjphrizdi████okebkyhywp47kbe6pyfh92caucq4svva░░░cc0zh4jp0moz5294zg2jv9dzy471bsszu████np7ftcfe9gkjfq3c1mhu0gr3x1wjtlez████b1zofkzp4tf1o3qzk6eoxtv7m6i4gnug████7f6vtqhfadw