DeepSeek is a platform offering powerful AI models, including deepseek-chat
(now upgraded to DeepSeek-V3) and deepseek-reasoner
(now DeepSeek-R1). Understanding the associated pricing is crucial for effective utilization and cost management. This article breaks down DeepSeek API's pricing structure, token usage, and important deduction rules.
DeepSeek API employs a token-based pricing model, common among many large language models (LLMs). You are billed based on the number of tokens processed by the model, both for input and output.
Here's a breakdown of the pricing for deepseek-chat
and deepseek-reasoner
models. Note that prices are listed per 1 million tokens.
USD Pricing:
Model | Context Length | Max CoT Tokens | Max Output Tokens | 1M Tokens Input Price (Cache Hit) | 1M Tokens Input Price (Cache Miss) | 1M Tokens Output Price |
---|---|---|---|---|---|---|
deepseek-chat |
64K | - | 8K | $0.07 | $0.27 | $1.10 |
deepseek-reasoner |
64K | 32K | 8K | $0.14 | $0.55 | $2.19 |
CNY Pricing:
Model | Context Length | Max CoT Tokens | Max Output Tokens | 1M Tokens Input Price (Cache Hit) | 1M Tokens Input Price (Cache Miss) | 1M Tokens Output Price |
---|---|---|---|---|---|---|
deepseek-chat |
64K | - | 8K | ¥0.5 | ¥2 | ¥8 |
deepseek-reasoner |
64K | 32K | 8K | ¥1 | ¥4 | ¥16 |
Key Considerations:
deepseek-chat
model has been upgraded to DeepSeek-V3, and deepseek-reasoner
now utilizes the DeepSeek-R1 model.deepseek-reasoner
uses CoT, which involves reasoning steps before providing the final answer. See Reasoning Model documentation for more details.max_tokens
is not specified, the default maximum output length is 4K tokens. Adjust max_tokens
to allow for longer outputs when needed.deepseek-reasoner
Output: The output token count for deepseek-reasoner
includes both the CoT tokens and the final answer tokens, priced the same.The cost calculation is straightforward:
The charges are directly deducted from your topped-up balance. If you have both topped-up and granted balances available, the granted balance is used first.
DeepSeek reserves the right to adjust product prices. It’s recommended to regularly check the Models & Pricing page for the most up-to-date pricing information and to top up your account based on your actual usage.
max_tokens
parameter to limit the length of the generated text and prevent unnecessary costs. Setting appropriate Temperature Parameter is criticalBy understanding DeepSeek API's pricing model and deduction rules, you can effectively manage your expenses while leveraging the power of its AI models. Regular monitoring and optimization are key to maximizing your return on investment.