Run Llama 3.1 8B Instruct Turbo on your data

Meta-Llama-3.1-8B-Instruct-Turbo is an 8 billion parameter LLM designed for multilingual dialogue use cases. It excels in providing fast and accurate responses across multiple languages, making it suitable for applications requiring quick turnaround times and multilingual support.

Some other noteworthy features of Meta-Llama-3.1-8B-Instruct-Turbo include its optimized performance for inference and its ability to maintain full accuracy compared to the reference implementation.

Metric	Value
Parameter Count	8 billion
Mixture of Experts	No
Context Length	128,000 tokens
Multilingual	Yes
Quantized*	Yes
Quantization*	Unknown

*Quantization is specific to the inference provider and the model may be offered with different quantization levels by other providers.

	Modality	Price (1M tokens)
Llama 3.1 405B Instruct	Fireworks AI	text	text	$3.00	$3.00
Llama 3.1 405B Instruct Turbo	Together.ai	text	text	$3.50	$3.50
Llama 3.1 70B Instruct	Fireworks AI	text	text	$0.90	$0.90
Llama 3.1 70B Instruct Turbo	Together.ai	text	text	$0.88	$0.88
Llama 3.1 8B	Groq	text	text	$0.05	$0.08
Llama 3.1 8B Instruct	Fireworks AI	text	text	$0.20	$0.20
Llama 3.1 8B Instruct Turbo	Together.ai	text	text	$0.18	$0.18
Llama 3.1 Nemotron 70B Instruct	Together.ai	text	text	$0.90	$0.90
Llama 3.2 11B Vision	Groq	image	text	$0.18	$0.18
Llama 3.2 1B	Groq	text	text	$0.04	$0.04
Llama 3.2 3B	Groq	text	text	$0.06	$0.06
Llama 3.2 3B Instruct Turbo	Together.ai	text	text	$0.06	$0.06
Llama 3.2 90B Vision (Preview)	Groq	image	text	$0.90	$0.90
Llama 3.3 70B Instruct	Fireworks AI	text	text	$0.90	$0.90
Llama 3.3 70B Instruct Turbo	Together.ai	text	text	$0.88	$0.88
Llama 3.3 70B Speculative Decoding	Groq	text	text	$0.59	$0.59
Llama 3.3 70B Versatile 128k	Groq	text	text	$0.59	$0.79

Modality

Price (1M tokens)

Model

Inference provider

Input

Output

Input

Output

Llama 3.1 405B Instruct

Fireworks AI