Sun, 28 Jun 2026 · LIVE
Updated Apr 5, 2025 · 18:49
Computer News Updated Apr 5, 2025

Musk's Grok-3 slightly outperforms Chinese DeepSeek AI: Report

Elon Musk's Grok-3 has slightly outperformed the Chinese AI model DeepSeek, highlighting a fascinating technological showdown in artificial intelligence. The breakthrough demonstrates two distinct approaches to AI development: Musk's massive computational strategy versus DeepSeek's efficient, algorithm-driven method. While Grok-3 utilized 200,000 GPUs, DeepSeek achieved comparable results with dramatically fewer resources. This competition underscores the evolving landscape of AI innovation, where computational power and algorithmic creativity are both critical pathways to advancement.

New Delhi, April 5

As the artificial intelligence (AI) turf war escalates, Elon Musk-owned Grok and Chinese DeepSeek models now stand at the forefront of AI capability -- one optimised for accessibility and efficiency and the other for brute-force scale -- despite the vast disparity in training resources, a report showed on Saturday.

Grok-3 represents scale without compromise -- 200,000 NVIDIA H100s chasing frontier gains, while DeepSeek-R1 delivers similar performance using a fraction of the compute, signalling that innovative architecture and curation can rival brute force, according to Counterpoint Research.

Since February, DeepSeek has grabbed global headlines by open-sourcing its flagship reasoning model DeepSeek-R1 to deliver performance on a par with the world’s frontier reasoning models.

“What sets it apart isn’t just its elite capabilities, but the fact that it was trained using only 2,000 NVIDIA H800 GPUs — a scaled-down, export-compliant alternative to the H100, making its achievement a masterclass in efficiency,” said Wei Sun, principal analyst in AI at Counterpoint.

Musk’s xAI has unveiled Grok-3, its most advanced model to date, which slightly outperforms DeepSeek-R1, OpenAI’s GPT-o1 and Google’s Gemini 2.

“Unlike DeepSeek-R1, Grok-3 is proprietary and was trained using a staggering 200,000 H100 GPUs on xAI’s supercomputer Colossus, representing a giant leap in computational scale,” said Sun.

Grok-3 embodies the brute-force strategy — massive compute scale (representing billions of dollars in GPU costs) driving incremental performance gains. It’s a route only the wealthiest tech giants or governments can realistically pursue.

“In contrast, DeepSeek-R1 demonstrates the power of algorithmic ingenuity by leveraging techniques like Mixture-of-Experts (MoE) and reinforcement learning for reasoning, combined with curated and high-quality data, to achieve comparable results with a fraction of the compute,” explained Sun.

Grok-3 proves that throwing 100x more GPUs can yield marginal performance gains rapidly. But it also highlights rapidly diminishing returns on investment (ROI), as most real-world users see minimal benefit from incremental improvements.

In essence, DeepSeek-R1 is about achieving elite performance with minimal hardware overhead, while Grok-3 is about pushing boundaries by any computational means necessary, said the report.

— IANS

Reader Comments

Jamie L.

Fascinating to see how different approaches yield similar results! The efficiency of DeepSeek is impressive, but I wonder if Grok's brute-force method will pull ahead in the long run. 🤔

Rahul K.

DeepSeek open-sourcing their model is a game-changer for AI research! More companies should follow this approach to accelerate innovation.

Maria S.

While the performance comparison is interesting, I think the article could have explored more about the environmental impact of these different approaches. 200,000 GPUs must have a massive carbon footprint!

Trevor W.

Elon's approach reminds me of the early days of SpaceX - throw resources at the problem until it works. Sometimes brute force does win, but at what cost? 💸

Ling C.

As an AI researcher, I'm more excited about DeepSeek's methods. Efficiency matters when you don't have billions to spend on compute. This could democratize AI development!

Alex P.

The diminishing returns point is crucial. Most users won't notice the difference between these models in daily use. Maybe we should focus more on practical applications than benchmark wars?

We welcome thoughtful discussions from our readers. Please keep comments respectful and on-topic.

Reader Voices

Leave a comment

Be kind. Add to the conversation. 0/50
Thank you — your comment has been submitted.
JS blocked