DeepSeek R1

published

DeepSeek's reasoning model

Open-source reasoning model. 90.8% MMLU, 97.3% MATH-500. Chain-of-thought reasoning with MIT license.

Provider deepseek Type reasoning Access open_source Params 671B MoE Context 128k License mit

Benchmarks (2)

BenchmarkScoreReported BySource
MMLU90.8%vendorsource ↗
MATH-50097.3%vendorsource ↗

Why It Matters

First open-source model to match o1-level reasoning. The reasoning traces are fully visible and inspectable.

Known Limitations

Very high token generation for reasoning (thinking tokens). Slower than non-reasoning models.

Provider deepseek
Released 2025-01-20
Training cutoff 2024-12
Created March 22, 2026
Last reconciled Never