DeepSeek / Deepseek V3 (FP8)
Released: 12/26/2024texttext
Input: $1.25 / Output: $1.25
DeepSeek V3 is a Mixture-of-Experts LLM that excels in efficient processing and advanced reasoning capabilities. It demonstrates strong performance across various tasks, including coding, mathematical computation, and complex problem-solving, while requiring significantly less computational resources compared to models of similar scale.
Some other noteworthy features of DeepSeek V3 include its multi-token prediction capability and its ability to handle extended context lengths.
Metric | Value |
---|---|
Parameter Count | 671 billion |
Mixture of Experts | Yes |
Active Parameter Count | 37 billion |
Context Length | 128,000 tokens |
Multilingual | Yes |
Quantized* | Yes |
Quantization* | FP8 |
*Quantization is specific to the inference provider and the model may be offered with different quantization levels by other providers.
DeepSeek models available on Oxen.ai
Modality | Price (1M tokens) | ||||
---|---|---|---|---|---|
Model | Inference provider | Input | Output | Input | Output |
Fireworks AI | text | text | $8.00 | $8.00 | |
Together.ai | text | text | $7.00 | $7.00 | |
Fireworks AI | text | text | $0.90 | $0.90 | |
Together.ai | text | text | $1.25 | $1.25 |