Introducing the Qwen 3 Family

AI Summary

The Quen team has released Quen 3, a comprehensive family of models including:

Mixture of experts models: 235 billion parameters (22 billion active) and 30 billion parameters (3 billion active).

Dense models ranging from 6 billion to 32 billion parameters.

Key Features:

Hybrid Models: Supports adjustable thinking reasons, enhancing performance based on token budget for reasoning.

Multilingual Support: Offers 119 languages and dialects, accommodating diverse linguistic communities.

Enhanced Tool Use: Improved capability for models to call various tools, increasing their functional versatility.

Training Improvements: The training tokens have doubled to approximately 36 trillion, incorporating extensive multilingual training and synthetic data for specific tasks.

Training Process:

Pre-training with 30 trillion tokens on general web data.

Incorporation of knowledge-intensive data (e.g., STEM) with synthetic data.

Supervised fine-tuning and reinforcement learning for varied reasoning abilities.

Testing and Access:

Available for testing at chat.quen.ai. Users can experiment with different models and settings.

Further updates and features expected in future releases.

ThirdBrAIn.tech

Explorer

Introducing the Qwen 3 Family

Introducing the Qwen 3 Family

Key Features:

Training Process:

Testing and Access:

Graph View

Table of Contents