📖 The AI Tool Bible

DeepSeek V3

by DeepSeekopen weights
DeepSeek V3 family
Parameters
671B
Context window
64K tokens
Released
2024-12-26
Input price / 1M tok
$0.27
Output price / 1M tok
$1.10
Modality
tools

About DeepSeek V3

DeepSeek's MoE flagship — 671B total, 37B active per token. Open weights, GPT-4-class benchmarks, and pricing 50-100x cheaper than Western frontier models. Broke the assumption that frontier LLMs need $100M training budgets.

Strengths

  • Open weights
  • Very cheap on hosted APIs
  • MoE efficiency
  • Strong code + math

Weaknesses

  • Text-only
  • 64K context (small vs peers)
  • US enterprise wariness of Chinese-origin

Best for

Cost-sensitive productionFine-tuningCode generation
Vendor page →HuggingFace →Paper / release →