DeepSeek / Deepseek R1
Released: 1/20/2025Deepseek R1 is a 671B parameter LLM designed for advanced reasoning and complex problem-solving.
It excels in performing tasks across multiple domains including mathematics, coding, and real-time decision-making with a focus on delivering reasoning capabilities comparable to OpenAI-o1. The model's massive parameter count enables it to capture intricate patterns and features in data, providing exceptional contextual comprehension for complex inputs.
Some other noteworthy features of Deepseek R1 include its Multi-Layer Attention (MLA) mechanism that effectively processes and analyzes information, and its incorporation of cold-start data before reinforcement learning to enhance performance.
Metric | Value |
---|---|
Parameter Count | 671 billion |
Mixture of Experts | Yes |
Active Parameter Count | Unknown |
Context Length | Unknown |
Multilingual | Unknown |
Quantized* | Unknown |
*Quantization is specific to the inference provider and the model may be offered with different quantization levels by other providers.
DeepSeek models available on Oxen.ai
Modality | Price (1M tokens) | ||||
---|---|---|---|---|---|
Model | Inference provider | Input | Output | Input | Output |
![]() | text | text | $3.00 | $8.00 | |
text | text | $0.55 | $2.19 | ||
text | text | $7.00 | $7.00 | ||
![]() | text | text | $0.59 | $0.79 | |
![]() | text | text | $0.75 | $3.00 | |
text | text | $0.27 | $1.10 | ||
text | text | $1.25 | $1.25 |