DeepSeek-V3: A New Era of Open-Source AI Model, Surpassing GPT-4o in Key Benchmarks

DeepSeek has officially launched its groundbreaking DeepSeek-V3 model, marking a significant advancement in open-source artificial intelligence. This new model aims to redefine the landscape of AI capabilities, challenging even leading closed-source models like GPT-4o and Claude-3.5-Sonnet.

What is DeepSeek-V3?

DeepSeek-V3 is a self-developed Mixture-of-Experts (MoE) model featuring 671 billion parameters, with 37 billion parameters activated during use. It was pre-trained on a massive 14.8 trillion tokens. This extensive training has enabled the model to perform exceptionally well across various benchmarks.

MoE Architecture: Utilizing a Mixture-of-Experts architecture allows DeepSeek-V3 to handle complex tasks efficiently by activating only a subset of its vast parameters.

Key Performance Metrics and Benchmarks

DeepSeek-V3 has demonstrated impressive performance across a range of benchmarks, outperforming other open-source models and rivaling top-tier, closed-source models.

Knowledge and Reasoning

MMLU, MMLU-Pro, GPQA, SimpleQA: DeepSeek-V3 shows significant improvements in knowledge-based tasks compared to its predecessor, DeepSeek-V2.5, closely matching the performance of Claude-3.5-Sonnet-1022.

Long Text Handling

DROP, FRAMES, LongBench v2: In long-text evaluations, DeepSeek-V3 surpasses other models, proving its ability to understand and process extensive content effectively.

Coding Prowess

Codeforces: In algorithmic coding scenarios, DeepSeek-V3 significantly outperforms existing non-o1 category models.
SWE-Bench Verified: For engineering-related code tasks, DeepSeek-V3 approaches the performance levels of Claude-3.5-Sonnet-1022.

Mathematical Capabilities

AIME 2024, MATH, CNMO 2024: DeepSeek-V3 demonstrates substantial improvements in mathematical problem-solving, exceeding the performance of both open-source and closed-source models in competitions such as the American Invitational Mathematics Examination (AIME) and the Chinese National Mathematical Olympiad (CNMO).

Chinese Language Understanding

C-Eval: In educational assessments; DeepSeek-V3 performs comparably to Qwen2.5-72B.
C-SimpleQA: Excels particularly in factual knowledge within the Chinese language context.

Enhanced Generation Speed

One of the most notable improvements in DeepSeek-V3 is its generation speed. Innovations in algorithms and engineering have boosted the token generation rate from 20 tokens per second (TPS) to an impressive 60 TPS, providing users with a much faster and smoother experience compared to DeepSeek-V2.5.

API Service Updates and Pricing

With the release of DeepSeek-V3, there have been adjustments to the API service pricing to reflect the enhanced performance and speed. Model API pricing is adjusted to ¥0.5/2 per million input tokens (cached/uncached) and ¥8 per million output tokens. To allow users to experience DeepSeek-V3 there is a promotional period until February 8, 2025. Until Feb 8, 2025, DeepSeek-V3 API service pricing is ¥0.1/1 per million input tokens (cached/uncached) and ¥2 per million output tokens.

Open Source and Local Deployment

DeepSeek-V3 is trained using FP8 and has open-sourced its native FP8 weights. This commitment to open source facilitates broader adoption and customization within the AI community.

SGLang and LMDeploy support native FP8 inference for V3 models, while TensorRT-LLM and MindIE have implemented BF16 inference.

For model weights and local deployment details, refer to the DeepSeek-V3-Base on Hugging Face.

DeepSeek's Commitment

DeepSeek's dedication to open-source principles and long-term vision aims to democratize AGI (Artificial General Intelligence) technology. The introduction of DeepSeek-V3 represents significant progress in narrowing the capability gap between open and closed-source models. DeepSeek plans to continue enhancing the DeepSeek-V3 base model with deeper reasoning and multimodal capabilities, sharing their advancements with the community.

Try DeepSeek-V3

To experience the DeepSeek-V3 model, visit chat.deepseek.com and begin interacting with the latest version. The API service has been updated, and no configuration changes are needed to start using the new model.

The launch of DeepSeek-V3 marks a significant milestone in AI development, showcasing the potential of open-source models to achieve and even surpass the capabilities of proprietary systems.

This article seeks to bring awareness to the launch of the DeepSeek-V3 model. For previous information about the DeepSeek models you can read about the DeepSeek-V2.5 Release.

. . .

Cursive Text Generator (copy and paste) ― LingoJam

I also made another translator which converts your text into all sorts of fancy styles: "fancy text generator". And another one that generates italic text. You' ...

Extensions - Chrome Web Store

26-in-1 Chrome extension to Research, Re-write, and Summarise content on any website. Wiseone - Your AI Search & Reading Copilot.

Norton Password Generator

Create strong passwords with Password Generator. 6#LBR1wR6_esp1druZAf Strong password Use the slider, and select from the options, below, to lengthen your ...

Trump says China's DeepSeek AI 'should be a wake-up call'

Jan 27, 2025 ... “DeepSeek — a new AI model controlled by the Chinese Communist Party — openly erases the CCP's history of atrocities and oppression,” he said, ...

Chrome and Chromium refuse to read chrome-flags.conf or ...

Feb 10, 2021 ... I discovered that neither Chrome or Chromium are reading his config files. He has both ~/.config/chrome-flags.conf and ~/.config/chromium-flags.conf, with the ...