DeepSeek Ignites a New Large Model Price War: Will Others Follow?

DeepSeek Ignites a New Large Model Price War: Will Others Follow?

The large language model (LLM) arena is heating up again, with DeepSeek, known as the "price butcher," initiating another round of price cuts. This move raises the question: will other players follow suit, and what does this mean for the future of LLMs?

DeepSeek's Aggressive Pricing Strategy

DeepSeek recently announced a significant reduction in its API input costs to 0.1 yuan per million tokens and output costs to 2 yuan per million tokens. This price slash represents another order of magnitude decrease in LLM API pricing.

Key Takeaways:

Dramatic Price Reduction: API input cost is now 0.1 yuan/million tokens.
Output Cost: API output cost adjusted to 2 yuan/million tokens.
Lower Barrier to Entry: This pricing makes LLMs more accessible to a wider range of users.

The Reason Behind the Price Cut: Contextual Hard Drive Caching

DeepSeek attributes this price reduction to its innovative use of contextual hard drive caching. The company explains that a significant portion of user input in LLM API usage is often repetitive.

How Contextual Hard Drive Caching Works:

Caching Repeated Content: Frequently used prompts and multi-turn conversation history are cached in a distributed hard drive array.
Reduced Computation: If input is repeated, the system retrieves it from the cache instead of recomputing it.
Lower Latency and Costs: This technology reduces service latency and significantly lowers usage costs.

DeepSeek is the first global LLM provider to widely adopt hard drive caching in its API service. This is made possible by the MLA (Mixture of LoRA Experts) structure introduced in DeepSeekV2, which improves model performance while significantly compressing the context KVCache size, reducing storage and bandwidth requirements, allowing for caching on low-cost hard drives.

Additional Advantages:

High Capacity: Designed for a daily capacity of 1 trillion tokens.
No Restrictions: Unrestricted flow and concurrency for users.

A History of Price Disruptions

This isn't DeepSeek's first foray into price wars. Since May, the company has been a disruptor in API pricing.

Timeline of DeepSeek's Price Cuts:

April 25: API priced at 1 yuan/million input tokens and 2 yuan/million output tokens.
May 6: Launch of open-source MoE model with lower parameters and stronger capabilities, maintaining the same API pricing. This pricing was approximately 1% of GPT-4 Turbo's price.

This initial price reduction triggered a wave of responses from industry players like Zhipu AI, Volcano Engine, Baidu, Tencent, and Alibaba Cloud, who all announced price cuts. Alibaba Cloud's Tongyi Qianwen core model Qwen-Long saw a staggering 97% price decrease, landing at 0.0005 yuan/thousand tokens. Baidu and Tencent even offered some models for free. Internationally, OpenAI's GPT-4o was released with free usage and halved API call prices.

Volcano Engine's Doubao general model pro-32k was priced at only 0.0008 yuan/thousand tokens, a 99.3% reduction compared to the industry average of 0.12 yuan/thousand tokens, pushing the market into the "cent era." Volcano Engine's president, Tan Dai, stated that reducing costs is crucial for accelerating the transition to a "value creation stage."

The Rationale Behind the Price War

The driving force behind these price reductions is to lower the barrier to entry for businesses to adopt and innovate with LLMs. A Volcano Engine insider noted that the lack of widespread enterprise application of LLMs necessitates lower prices to encourage adoption.

Key Objectives:

Reduce Usage Barriers: Lower prices make it easier for businesses to experiment with and implement LLMs.
Promote Innovation: Encourage the development of new applications across various industries.
Customer Acquisition: Attract developers and partners to build an ecosystem around the platform.

However, the financial sustainability of relying solely on API sales remains a concern. "No large model company survives by selling APIs alone," stated one FA (Financial Advisor) familiar with the LLM industry. 猎豹移动 ( Cheetah Mobile ) Chairman and CEO Fu Sheng believes that significant price cuts force LLM startups to explore new business models. Large companies with cloud services can afford to lower prices to attract more customers.

The Future of LLM Pricing

Unlike the previous wave of price cuts, DeepSeek's latest move has not yet been met with immediate responses from other major LLM companies. However, this continued downward price trend suggests that the democratization of LLMs is underway, and the vertical application ecosystem is poised for further growth.

Possible Outcomes:

Increased Adoption: More businesses and developers will adopt LLMs.
Vertical Application Growth: Development of specialized applications across various industries will accelerate.
New Business Models: LLM companies will be forced to explore alternative revenue streams beyond API sales, such as SaaS offerings, custom model development and industry-specific solutions.

The LLM landscape is evolving rapidly, and these price wars are a testament to the increasing competition and the push for wider adoption. As LLMs become more accessible, we can expect to see even more innovative applications emerge across various sectors.

. . .

Convert PDF to Word for free | Acrobat

Learn how to convert PDF to Word using our online tool. Get started with our free PDF to Word converter to convert PDF to DOCX and more.

AnalyzeDirect : 3D Medical Image Analysis Software for Research

AnalyzeDirect is a research‐oriented medical technology company specializing in the worldwide distribution and support of Analyze, a leading advanced ...

Free AI Image Generator, Text to Image App from Microsoft Designer ...

Create breathtaking images in seconds with Microsoft Designer's free AI image generator. From photos to pop art, bring your boldest ideas to life.

Usage of meta tag "generator" - Webmasters Stack Exchange

Jun 11, 2012 ... Meta tags are only useful if searching, indexing or special use consumers use them. Titles, if your content is being absorbed by indexing ...

Upgrade Your PC: Faster and More Secure with ChromeOS Flex

Install ChromeOS Flex on your PC or Mac to replace the operating system. Step ... You'll need to purchase Chrome Enterprise Upgrade (or Chrome Education ...