Meta / Llama 3.2 3B Instruct Turbo
Released: 9/25/2024texttext
Input: $0.06 / Output: $0.06
Llama-3.2-3B-Instruct-Turbo is a Small Language Model (SLM) designed for fast inference and lightweight applications. It excels in providing quick responses for tasks such as dialogue, summarization, and general text generation while maintaining a balance between performance and efficiency.
Metric | Value |
---|---|
Parameter Count | 3 billion |
Mixture of Experts | No |
Context Length | 128,000 tokens |
Multilingual | Yes |
Quantized* | No |
*Quantization is specific to the inference provider and the model may be offered with different quantization levels by other providers.
Meta models available on Oxen.ai
Modality | Price (1M tokens) | ||||
---|---|---|---|---|---|
Model | Inference provider | Input | Output | Input | Output |
Fireworks AI | text | text | $3.00 | $3.00 | |
Together.ai | text | text | $3.50 | $3.50 | |
Fireworks AI | text | text | $0.90 | $0.90 | |
Together.ai | text | text | $0.88 | $0.88 | |
Groq | text | text | $0.05 | $0.08 | |
Fireworks AI | text | text | $0.20 | $0.20 | |
Together.ai | text | text | $0.18 | $0.18 | |
Together.ai | text | text | $0.90 | $0.90 | |
Groq | image | text | $0.18 | $0.18 | |
Groq | text | text | $0.04 | $0.04 | |
Groq | text | text | $0.06 | $0.06 | |
Together.ai | text | text | $0.06 | $0.06 | |
Groq | image | text | $0.90 | $0.90 | |
Fireworks AI | text | text | $0.90 | $0.90 | |
Together.ai | text | text | $0.88 | $0.88 | |
Groq | text | text | $0.59 | $0.59 | |
Groq | text | text | $0.59 | $0.79 |