Companies

DeepSeek's R1 Model Training Costs Far Exceed Initial Estimates

Published February 3, 2025

A research firm has conducted an extensive analysis revealing that the cost to develop DeepSeek's R1 model was much higher than the company initially claimed.

Market Shock and DeepSeek's Claims

When DeepSeek, a Chinese AI company, announced its R1 model, it stated that the training cost was a mere $6 million. This claim startled the technology sector and led to a significant market decline, wiping out approximately $1 trillion in value, with $600 billion lost in the valuation of NVIDIA alone. In comparison, OpenAI's GPT-4 model is estimated to have a training cost ranging from $100 million to $200 million.

The initial perception was that the costs associated with training new AI models were inflated, but the findings from the research firm SemiAnalysis suggest otherwise. According to their analysis, the real cost of training DeepSeek's R1 model is far from the $5 million reported.

Breakdown of Training Expenses

In their report, SemiAnalysis detailed that DeepSeek invested heavily in GPU resources. They purchased 10,000 units of NVIDIA's A100 GPUs back in 2021. Additionally, they acquired 10,000 NVIDIA H8000 AI GPUs, specifically tailored for the Chinese market, along with another 10,000 NVIDIA H100 GPUs. The total investment in hardware alone amounts to a staggering $1.6 billion, while the ongoing operational expenses are estimated at about $944 million.

This monumental expenditure raises questions about the authenticity of DeepSeek's initial cost claims. More importantly, it draws attention to the competitive landscape of AI technology development, where investments can run into billions rather than millions.

As the implications of this revelation unfold, the market may need to recalibrate its understanding of AI training costs and the value attributed to groundbreaking models like DeepSeek's R1.

DeepSeek, AI, Market