DeepSeek V4 Preview: China AI Upstart Challenges Silicon Valley Again
DeepSeek V4 Preview: China’s AI Upstart Challenges Silicon Valley Again
Exactly 15 months after its V3 model sent shockwaves through Silicon Valley, Chinese AI lab DeepSeek has released a preview of its long-awaited V4 model — and the early numbers suggest the gap between open-source and frontier AI is closing faster than anyone expected.
Announced on April 24, 2026, DeepSeek V4 arrives in the same week as OpenAI’s GPT-5.5 (“Spud”) and Anthropic’s Claude Opus 4.7, making it arguably the most competitive week in AI history. But unlike its American rivals, DeepSeek is doing something radical: releasing its most powerful model yet as open weights under the MIT license, available for anyone to download, modify, and deploy.

Two Models, One Ambition: V4-Pro and V4-Flash
DeepSeek V4 comes in two variants, both built on a Mixture-of-Experts (MoE) architecture:
- DeepSeek-V4-Pro: The flagship model with 1.6 trillion total parameters — making it the largest open-weights model ever released — but only 49 billion active parameters per token. Pre-trained on 33 trillion tokens, it achieves an 80.6% score on SWE-bench Verified, placing it within 0.2 points of Claude Opus 4.6’s verified score.
- DeepSeek-V4-Flash: A leaner variant with 284 billion total parameters and 13 billion active parameters, designed for high-throughput, cost-sensitive applications.
Both models support a native one-million-token context window — a significant leap that enables processing entire codebases, lengthy legal documents, or full books in a single inference pass.
“DeepSeek-V4 is the first open model family built from the ground up to rival proprietary frontier models on both performance and scale,” the company stated in its announcement.
The Huawei Connection: AI Independence in Action
Perhaps the most geopolitically significant aspect of the V4 release is its deep integration with Huawei’s Ascend AI processors. DeepSeek worked closely with Huawei to optimize V4 for Chinese-made chips, a direct response to U.S. export restrictions that have blocked access to Nvidia’s most advanced GPUs.
On the same day as the V4 preview launch, Huawei announced “full support” for DeepSeek’s models on its Ascend platform. This partnership signals China’s growing self-sufficiency in AI infrastructure and demonstrates that competitive models can be trained without relying on Western semiconductor supply chains.
Cost Advantage That Can’t Be Ignored
DeepSeek has built its reputation on extreme cost efficiency, and V4 continues that trajectory. On standard cache-miss pricing, V4-Pro costs approximately one-seventh as much as GPT-5.5 and about one-sixth the cost of Claude Opus 4.7 per million input tokens.
For output tokens, the price gap is even more dramatic: V4-Pro charges roughly $3.48 per million output tokens compared to Claude’s $25 — a 7x price advantage at near-identical benchmark performance on coding tasks.
When cached input discounts are applied, the cost differential widens further. For teams running high-volume inference workloads, V4-Flash’s 13-billion active parameter footprint delivers utility at a fraction of the cost of any Western competitor.
Benchmark Performance: The Numbers Behind the Claims
The V4-Pro’s 80.6% score on SWE-bench Verified is the headline benchmark, but the model reportedly performs competitively across a range of standard evaluations:
- SWE-bench Verified: 80.6% — within 0.2 points of Claude Opus 4.6
- AIME (mathematics): Competitive with GPT-5.5 and Claude Opus 4.7
- GPQA (scientific reasoning): Near frontier-level performance
- Long-context tasks: One-million-token native context enables novel applications in document analysis and code understanding
It’s important to note that DeepSeek describes V4 as a “preview” release, meaning these benchmark scores are preliminary and may improve before the final launch. The company has already delayed the full release three times over the past four months, suggesting a focus on quality over speed.
How V4 Compares to the Competition
The April 2026 AI landscape is unprecedented: three world-class models from three different labs, all released within days of each other. Here’s how they stack up:
- GPT-5.5 (OpenAI): The most capable agentic model, with deep tool-use integration and a growing “super app” ecosystem. Closed-source, premium pricing.
- Claude Opus 4.7 (Anthropic): The verified benchmark leader with strongest long-context reasoning and safety alignment. Closed-source, highest pricing tier.
- DeepSeek V4 (DeepSeek AI): The open-source challenger matching 95%+ of frontier performance at roughly 15% of the cost. MIT-licensed, self-hostable.
The choice between them increasingly comes down to a trade-off between absolute peak performance (Western models) versus cost efficiency and openness (DeepSeek).
What This Means for Developers and Enterprises
For developers, V4’s MIT license is transformative. Unlike proprietary models that require ongoing API subscriptions, V4 can be downloaded from Hugging Face, modified, and deployed on-premises — giving teams full control over their AI infrastructure.
The model is already available for testing through three channels:
- chat.deepseek.com: V4-Pro is accessible as “Expert Mode” in DeepSeek’s chat interface
- DeepSeek API: Both Pro and Flash variants are available for integration
- Hugging Face: Full model weights are downloadable under the MIT license
For enterprises, the pricing differential is compelling. A company processing 100 million input tokens monthly would pay approximately $14 with V4-Pro versus $150+ with Claude or GPT — a savings that compounds rapidly at scale.
The Retirement Timeline: V3 and V3.2 Are Being Phased Out
DeepSeek has announced that V3 and V3.2 will be retired after July 24, 2026, giving developers a three-month migration window. The company is clearly betting that V4’s performance and pricing will make the transition painless — and for most use cases, the upgrade path looks straightforward.
The Road Ahead: Can Open Source Win the AI Race?
DeepSeek’s V4 preview represents more than a product launch — it’s a statement about the future of AI development. By releasing a 1.6-trillion-parameter model under a permissive open-source license, DeepSeek is betting that transparency and accessibility will ultimately outweigh the advantages of closed, proprietary systems.
Whether V4 lives up to its preview claims will depend on the final release. But the early signals are clear: the AI race is no longer a two-horse competition between OpenAI and Anthropic. DeepSeek has firmly established itself as a third pole — and with V4, it’s closer to the top than ever before.
Key Takeaways
- DeepSeek V4-Pro (1.6T parameters, 49B active) and V4-Flash (284B, 13B active) are now available as preview releases
- Both models feature a native 1-million-token context window
- V4-Pro scores 80.6% on SWE-bench Verified, nearly matching Claude Opus 4.6
- API costs are 6-7x lower than GPT-5.5 and Claude Opus 4.7
- Full model weights are open-sourced under the MIT license on Hugging Face
- Optimized for Huawei Ascend AI chips, demonstrating China’s AI self-sufficiency
- V3 and V3.2 will be retired on July 24, 2026
📖 Related: China’s DeepSeek Previews New AI Model a Year After Jolting US Rivals
📖 Related: OpenAI’s Latest Model Evolution: What’s Real, What’s Hype, and Why Coding Keeps Getting Better




