Run Deepseek R1 on your data

Deepseek R1 is a 671B parameter LLM designed for advanced reasoning and complex problem-solving.

It excels in performing tasks across multiple domains including mathematics, coding, and real-time decision-making with a focus on delivering reasoning capabilities comparable to OpenAI-o1. The model's massive parameter count enables it to capture intricate patterns and features in data, providing exceptional contextual comprehension for complex inputs.

Some other noteworthy features of Deepseek R1 include its Multi-Layer Attention (MLA) mechanism that effectively processes and analyzes information, and its incorporation of cold-start data before reinforcement learning to enhance performance.

Metric	Value
Parameter Count	671 billion
Mixture of Experts	Yes
Active Parameter Count	Unknown
Context Length	Unknown
Multilingual	Unknown
Quantized*	Unknown

*Quantization is specific to the inference provider and the model may be offered with different quantization levels by other providers.

		Modality		Price (1M tokens)
Model	Inference provider	Input	Output	Input	Output
Deepseek R1	Fireworks AI	text	text	$3.00	$8.00
Deepseek R1	DeepSeek	text	text	$0.55	$2.19
Deepseek R1 (FP8)	Together.ai	text	text	$7.00	$7.00
Deepseek R1 Distill Llama 70B	Groq	text	text	$0.59	$0.79
Deepseek V3	Fireworks AI	text	text	$0.75	$3.00
Deepseek V3	DeepSeek	text	text	$0.27	$1.10
Deepseek V3 (FP8)	Together.ai	text	text	$1.25	$1.25

DeepSeek / Deepseek R1

DeepSeek models available on Oxen.ai