MiniMax M2.1

MiniMax’s advanced coding-focused language model released December 23, 2025.

Key Specifications

  • Release Date: December 23, 2025
  • Architecture: Sparse Mixture-of-Experts (MoE)
  • Total Parameters: 230 billion
  • Active Parameters: 10 billion per token
  • Context Window: 200,000 tokens
  • Quantization: FP8 native
  • Attention: Multi-Head Attention (MHA)
  • Modalities: Text, image, audio, video inputs

Pricing

  • Input: $0.30 per million tokens
  • Output: $1.20 per million tokens

Benchmarks

BenchmarkM2.1 ScoreNotes
SWE-Multilingual72.5%Outperforms Claude Sonnet 4.5, Gemini 3 Pro
VIBE (Full-Stack)88.6%Web: 91.5, Android: 89.7
SWE-bench Verified74.0%Up from M2’s 69.4%
Multi-SWE-bench49.4%Up from M2’s 36.2%
MMLU88General knowledge

Key Capabilities

Coding Focus

  • Multi-file code editing
  • Code-run-fix loops
  • Multi-language programming (Python, JS, Go, Rust, etc.)
  • Full-stack web and mobile development
  • Native Android and iOS development
  • Godot game engine and C++ graphics

Efficiency

  • Only 10B active parameters = low latency
  • Runs on consumer H100 or dual RTX 4090
  • Concise, high-efficiency responses
  • Near-instant latency for developer workflows

Long Context

  • 200K tokens (up from M2’s 128K)
  • Process entire product specs, design docs
  • Simplified RAG setups

Improvements Over M2

  • Context: 128K → 200K tokens
  • SWE-Multilingual: 56.5% → 72.5%
  • Multi-SWE-bench: 36.2% → 49.4%
  • Advanced Interleaved Thinking for reasoning
  • Reduced latency and verbosity

Use Cases

  • AI coding assistants (Claude Code, Cline, Roo Code, Droid)
  • Long-horizon developer toolchains
  • Complex agentic workflows
  • Full-stack application development

See Also