DeepSeek-V2.5: A New Open-Source Model Combining General and Coding Capabilities | DeepSeek API Docs

DeepSeek-V2.5: A Powerful Open-Source Model Combining General and Coding Prowess

DeepSeek has officially launched DeepSeek-V2.5, a next-generation open-source model that seamlessly blends general conversational capabilities with robust code processing power. This innovative model is designed to offer a more streamlined, intelligent, and efficient user experience for a wide range of applications, marking a significant leap forward in the field of AI.

What is DeepSeek-V2.5?

DeepSeek-V2.5 represents a fusion of DeepSeek-V2-0628 and DeepSeek-Coder-V2-0724, combining the strengths of both models. This powerful combination results in a model that not only retains the general conversational abilities of the Chat model but also maintains the robust code processing capabilities of the Coder model. Furthermore, DeepSeek-V2.5 is meticulously aligned with human preferences, making it more intuitive and user-friendly.

The model is available on both the web and API, featuring backward-compatible API endpoints accessible through deepseek-coder or deepseek-chat. Key features such as Function Calling, FIM (Fill-In-the-Middle) completion, and JSON output remain unchanged, ensuring a smooth transition for existing users.

Key Improvements and Capabilities

Enhanced Writing and Instruction-Following: Significant improvements have been implemented in DeepSeek-V2.5, specifically targeting writing tasks and instruction comprehension.
General Conversational Abilities: Retains the natural language processing prowess of previous Chat models.
Robust Code Processing: Inherits and enhances the code processing capabilities of the Coder model line, enabling efficient code generation, analysis, and understanding. Learn more about coding with large language models.
Human Preference Alignment: Enhanced to better understand and align with human preferences, delivering more relevant and helpful responses.

The Evolution of DeepSeek-V2.5

DeepSeek's commitment to model refinement has been a driving force behind the development of DeepSeek-V2.5. Here's a look at its evolution:

June Upgrade: DeepSeek-V2-Chat's base model was replaced with Coder-V2-base, greatly enhancing its code generation and reasoning capabilities. This upgrade led to the release of DeepSeek-V2-Chat-0628.
Coder Model Launch: Shortly after, DeepSeek-Coder-V2-0724 was launched with improved general capabilities through alignment optimization.
Model Fusion: Ultimately, the Chat and Coder models were successfully merged to create the new, unified DeepSeek-V2.5, offering the best of both worlds.

It's recommended to adjust system prompts and temperature settings for optimal results due to the significant updates in this version. Understanding and tweaking the temperature parameter can help in achieving desired outputs.

Performance Benchmarks and Evaluations

General Capabilities

DeepSeek-V2.5 has been rigorously evaluated using industry-standard test sets, consistently outperforming both DeepSeek-V2-0628 and DeepSeek-Coder-V2-0724 on most benchmarks.

DeepSeek-V2.5 General Capability Evaluation

Internal Chinese evaluations reveal significant improvements in win rates against GPT-4o mini and ChatGPT-4o-latest, particularly in tasks like content creation and Q&A, enhancing overall user satisfaction.

DeepSeek-V2.5 Internal Evaluation

Safety Evaluation

DeepSeek has prioritized safety and helpfulness throughout the development process. DeepSeek-V2.5 features clearly defined safety boundaries, improving resistance to jailbreak attacks while reducing overgeneralization of safety policies to normal queries.

Model	Overall Safety Score (higher is better)	Safety Spillover Rate (lower is better)
DeepSeek-V2-0628	74.4%	11.3%
DeepSeek-V2.5	82.6%	4.6%

Code Capabilities

DeepSeek-V2.5 retains the robust code capabilities of DeepSeek-Coder-V2-0724, demonstrating notable improvements in the HumanEval Python and LiveCodeBench tests. While DeepSeek-Coder-V2-0724 slightly outperformed in HumanEval Multilingual and Aider tests, both versions showed room for improvement in the SWE-verified test.

Additionally, the DS-FIM-Eval internal test set showcased a 5.1% improvement in the FIM completion task, enhancing the plugin completion experience. DeepSeek-V2.5 has also been optimized for common coding scenarios to improve user experience. In the DS-Arena-Code internal subjective evaluation, DeepSeek-V2.5 achieved a significant win rate increase against competitors.

DeepSeek-V2.5 Code Capability Evaluation

DeepSeek-V2.5 Code Capability Subjective Evaluation

Open-Source Availability

DeepSeek-V2.5 is now available as an open-source model on Hugging Face. You can explore and download it here. Make sure you understand token usage to effectively utilize the model.

Conclusion

DeepSeek-V2.5 marks a significant advancement in AI technology, seamlessly integrating general conversational capabilities with powerful coding functionalities. Its open-source availability promotes collaboration and innovation within the AI community. This model promises a more versatile and efficient user experience, paving the way for new possibilities in both general AI applications and specialized coding tasks. Stay updated with the latest news on DeepSeek and explore related models like DeepSeek-R1 for supercharged reasoning capabilities.

. . .

New Report Analyzes Long History of NASA Support for Commercial ...

Dec 19, 2024 ... Throughout its history, NASA has supported the development of the commercial space sector, not only leading the way in areas such as satellite ...

Internet Speed Test by Speedcheck - Test my internet speed

An internet speed test measures the connection speed and quality of your connected device to the internet. It does so by running multiple consecutive tests that ...

Unlocking the Power of Kimi AI Chatbot in the Chinese Market

Apr 14, 2024 ... Kimi AI Chatbot has emerged as a transformative force, reshaping industry dynamics and redefining standards.

Free AI Detector | GPT-4, GPT-3, & ChatGPT AI Checker

A New Standard of AI Detection by the Leading Writing Assistant. Transparent, responsible AI use without all the guesswork: Grammarly's AI content detector and ...

AI Detector by Copyleaks - Detect AI Text With Confidence

The AI Content Detector is trained to recognize human writing patterns, flagging text as potential generative AI when it detects deviations from known human ...