DeepSeek-V3: A Leap Forward in Open-Source AI – Performance, Speed, and Accessibility

DeepSeek AI has officially launched its latest model, DeepSeek-V3, marking a significant milestone in the advancement of open-source artificial intelligence. This comprehensive article explores the key features, performance benchmarks, and accessibility aspects of DeepSeek-V3, providing valuable insights for AI enthusiasts, developers, and industry professionals.

Introducing DeepSeek-V3: A New Era of Open-Source Models

On December 26, 2024, DeepSeek unveiled the first version of its DeepSeek-V3 model series, immediately making it available to the public under an open-source license. This release reinforces DeepSeek's commitment to democratizing AI technology and fostering collaboration within the AI community. Users can start interacting with the DeepSeek-V3 model via chat.deepseek.com, with API services updated accordingly.

Performance Benchmarks: Competing with Leading Closed-Source Models

DeepSeek-V3 is a self-developed Mixture of Experts (MoE) model, featuring 671B parameters with 37B active parameters, pre-trained on 14.8T tokens. According to the research paper available on GitHub, DeepSeek-V3 outperforms other open-source models like Qwen2.5-72B and Llama-3.1-405B in several benchmarks and rivals top-tier closed-source models like GPT-4o and Claude-3.5-Sonnet.

Key Performance Highlights:

Knowledge Domain: Excels significantly in knowledge-based tasks, such as MMLU, MMLU-Pro, GPQA, and SimpleQA, nearing the performance of Claude-3.5-Sonnet-1022.
Long Text Handling: Demonstrates superior average performance in long-text assessments, including DROP, FRAMES, and LongBench v2.
Coding Proficiency: Outperforms existing non-o1 models in algorithmic coding scenarios (Codeforces) and approaches Claude-3.5-Sonnet-1022 in engineering code tasks (SWE-Bench Verified).
Mathematical Acumen: Shows significant advancements compared to other models across all other open and closed source models tested in mathematical competitions such as the American Invitational Mathematics Examination (AIME 2024, MATH) and the Chinese National Mathematical Olympiad (CNMO 2024).
Chinese Language Understanding: Matches Qwen2.5-72B in educational evaluations (C-Eval) and pronoun disambiguation while leading in factual knowledge (C-SimpleQA).

Accelerated Generation Speed: A 3x Improvement

DeepSeek-V3 introduces significant enhancements in generation speed through algorithmic and engineering innovations. The model's text generation speed has increased from 20 TPS (tokens per second) to 60 TPS, representing a threefold improvement compared to the V2.5 model. This enhancement results in a notably smoother and more responsive user experience.

API Service Pricing Adjustments and Introductory Offers

With the launch of DeepSeek-V3, DeepSeek AI has adjusted its API service pricing to reflect the model's enhanced capabilities. The standard pricing is set at:

Input tokens: 0.5 RMB per million tokens (cache hit), 2 RMB (cache miss)
Output tokens: 8 RMB per million tokens

To encourage adoption, DeepSeek AI offers a promotional pricing period of 45 days, valid until February 8, 2025. During this period, the API service will be available at the previous rate:

Input tokens: 0.1 RMB per million tokens (cache hit), 1 RMB (cache miss)
Output tokens: 2 RMB per million tokens

This offer applies to both existing and new users registered before the expiration date. For more details on API services provided by DeepSeek, refer to the DeepSeek API documentation.

Open-Source Weights and Local Deployment

DeepSeek-V3 is trained using FP8 and provides natively open-sourced FP8 weights. With support from the open-source community, projects like SGLang and LMDeploy have immediately supported native FP8 inference for the V3 model, while TensorRT-LLM and MindIE offer BF16 inference. To further aid community adaptation and broaden potential applications, DeepSeek AI has provided FP8 to BF16 conversion scripts.

Model weights and additional local deployment details can be found on Hugging Face.

DeepSeek’s Commitment to Open-Source AGI

DeepSeek AI continues to demonstrate its dedication to the open-source movement, aiming to make advanced AI technology accessible to everyone. By sharing its latest advancements in model pre-training, DeepSeek AI is actively contributing to narrowing the performance gap between open-source and closed-source models.

The release of DeepSeek-V3 marks a new beginning, with plans to enrich the model with deeper reasoning capabilities and multi-modality features. DeepSeek AI remains committed to sharing its ongoing explorations and advancements with the broader AI community, helping to drive further innovation.

Dive Deeper into DeepSeek:

For more information and updates, follow DeepSeek AI through the following channels:

GitHub: DeepSeek AI
Discord: DeepSeek AI Discord
Twitter: DeepSeek AI Twitter

Conclusion

The release of DeepSeek-V3 represents a significant step forward in the field of open-source AI. With its impressive performance, increased generation speed, and accessible pricing, DeepSeek-V3 empowers developers and researchers to explore new frontiers in AI. DeepSeek AI's commitment to openness and collaboration paves the way for continued innovation and democratization within the AI landscape.

. . .

LastPass: #1 Password Manager & Vault App with Single-Sign On ...

Your LastPass vault secures your data on your trusted device through zero-knowledge encryption. Your device encrypts and hashes your passwords locally before ...

Image to PDF – Convert Images to PDF Online

To begin, upload one or up to 20 images to our conversion tool. You can do this by dragging and dropping your images onto the “Drop Your Files Here” field. Or, ...

I made a tag generator... need help testing it : r/EtsySellers

Aug 13, 2021 ... 119 votes, 70 comments. Hey sellers, I made a little tag generator that finds tags related to the keyword you enter.

AI Stealth Writer: Rewrite AI Text, Humanize and Avoid Detection

The AI Stealth Writer was designed to elevate your writing by utilizing highly sophisticated AI so your texts pass AI detectors like GPTzero, Copyleaks, ZeroGPT ...

全网都在扒的DeepSeek团队，是清北应届生撑起一片天

Jan 4, 2025 ... 为DeepSeek提出MLA新型注意力、GRPO强化学习对齐算法等关键创新的，几乎都是年轻人。 DeepSeek核心成员揭秘. 2024年5月发布的DeepSeek-V2，是致使这家大模型 ...