DeepSeek-V3: A New Era of Open-Source AI Model, Surpassing GPT-4o in Key Benchmarks

DeepSeek AI has officially released the first version of its new series model, DeepSeek-V3, and simultaneously open-sourced it to the AI community. This launch marks a significant milestone in the pursuit of accessible and powerful artificial intelligence.

Introducing DeepSeek-V3: The Latest Innovation

The DeepSeek-V3 model is now available for interaction on the official website, chat.deepseek.com. The API service has also been updated, requiring no configuration changes for existing users. It's worth noting that the current version does not support multimodal input and output.

Performance That Rivals Closed-Source Giants

DeepSeek-V3 is a proprietary Mixture of Experts (MoE) model, boasting 671 billion parameters with 37 billion activated during use. It has been pre-trained on a massive 14.8 trillion tokens.

For detailed technical specifications, you can refer to the research paper available on GitHub.

Benchmarking Excellence

DeepSeek-V3 has demonstrated impressive performance across various benchmarks, surpassing other open-source models like Qwen2.5-72B and Llama-3.1-405B. In several key areas, it rivals the performance of leading closed-source models such as GPT-4o and Claude-3.5-Sonnet.

Knowledge Proficiency: DeepSeek-V3 exhibits a significant improvement in knowledge-based tasks (MMLU, MMLU-Pro, GPQA, SimpleQA) compared to its predecessor, DeepSeek-V2.5, approaching the capabilities of Claude-3.5-Sonnet-1022.
Long Text Understanding: The model excels in long-text evaluations (DROP, FRAMES, LongBench v2), outperforming other models on average.
Coding Prowess: DeepSeek-V3 demonstrates exceptional performance in algorithmic coding scenarios (Codeforces), surpassing all existing non-o1 class models. It also closely approaches the performance of Claude-3.5-Sonnet-1022 in engineering code scenarios (SWE-Bench Verified).
Mathematical Reasoning: In mathematics competitions like the AIME 2024, MATH, and CNMO 2024, DeepSeek-V3 significantly outperforms all other open-source and closed-source models.
Chinese Language Understanding: DeepSeek-V3 shows comparable performance to Qwen2.5-72B in educational assessments like C-Eval and pronoun disambiguation tasks. It exhibits superior performance in factual knowledge as assessed by C-SimpleQA.

Enhanced Generation Speed: 3x Faster

Through innovative algorithms and engineering optimizations, DeepSeek-V3 achieves a generation speed of 60 tokens per second (TPS), a threefold increase compared to the V2.5 model.

API Pricing Adjustments

With the introduction of the more powerful and faster DeepSeek-V3, the model API service pricing has been adjusted to:

Input tokens: ¥0.5 per million tokens (cache hit) / ¥2 per million tokens (cache miss)
Output tokens: ¥8 per million tokens

However, DeepSeek AI is offering a 45-day promotional period where the API service pricing will remain at the previous rates:

Input tokens: ¥0.1 per million tokens (cache hit) / ¥1 per million tokens (cache miss)
Output tokens: ¥2 per million tokens

This special pricing is available to both existing and new users who register before February 8, 2025.

Open Source Weights and Local Deployment

DeepSeek-V3 is trained using FP8 and open-sources the native FP8 weights. The open-source community has already provided support for V3 model's native FP8 inference through SGLang and LMDeploy, while TensorRT-LLM and MindIE have implemented BF16 inference. DeepSeek AI also provides a conversion script from FP8 to BF16.

Model weights and deployment information can be found on Hugging Face. See how DeepSeek compares to other models on the Open Model Initiative Leaderboard.

A Commitment to Open-Source and Accessible AGI

DeepSeek AI's commitment to open-source principles continues with the release of DeepSeek-V3, believing that sharing its advancements with the community will help diminish the capability gap between open and closed-source models.

This is just the beginning, DeepSeek AI plans to enhance the DeepSeek-V3 base model with features like reasoning capabilities and multimodal support.

Further exploration

You might find these topics interesting too:

. . .

Hearts 247 - Play Free Hearts Card Games Online

This guide aims to teach you how to play Hearts with examples. We cover the game's rules before working through an illustration round to help you better ...

Tech war: Huawei cloud unit powers DeepSeek models in China's ...

Feb 2, 2025 ... The cloud-computing unit of Huawei Technologies has worked overtime with a local company during the Lunar New Year holidays to make ...

Bedrock and Deepseek-R1 | AWS re:Post

Jan 24, 2025 ... Hi, I've been using Claude 3.5 Sonnet on Bedrock as a no-brainer tool to use this model very rapidly for some labs / POC tests.

Currency converter | Currency exchange calculator – Yahoo Finance

Get a fast and easy calculator for converting one currency to another using the latest live exchange rates. Also, get the latest news that could affect ...

Free Word to Excel Converter | Smallpdf

Jul 23, 2024 ... Free and simple online converter to turn Word documents into Excel spreadsheets. No watermarks, and no registration or installation needed.