DeepSeek V4 takes the top spot on reasoning benchmarks at one-eighth the training cost of GPT-5
The 671-billion-parameter mixture-of-experts model from DeepSeek scored 94.3 on MMLU-Pro and outperformed OpenAI’s o3 on math tasks, yet the Hangzhou lab spent only $8.2 million on the final training run, according to internal documents reviewed by DTW.