Meta / Llama 3.2 3B
Released: 9/25/2024texttext
Input: $0.06 / Output: $0.06
Llama 3.2 3B is a Small Language Model (SLM) designed for applications requiring low-latency inferencing and limited computational resources. It excels in text summarization, classification, and language translation tasks, making it suitable for mobile AI-powered writing assistants and customer service applications.
Some other noteworthy use cases of Llama 3.2 3B include personal information management and multilingual knowledge retrieval.
Metric | Value |
---|---|
Parameter Count | 3 billion |
Mixture of Experts | No |
Context Length | 128,000 tokens |
Multilingual | Yes |
Quantized* | Unknown |
*Quantization is specific to the inference provider and the model may be offered with different quantization levels by other providers.
Meta models available on Oxen.ai
Modality | Price (1M tokens) | ||||
---|---|---|---|---|---|
Model | Inference provider | Input | Output | Input | Output |
Fireworks AI | text | text | $3.00 | $3.00 | |
Together.ai | text | text | $3.50 | $3.50 | |
Fireworks AI | text | text | $0.90 | $0.90 | |
Together.ai | text | text | $0.88 | $0.88 | |
Groq | text | text | $0.05 | $0.08 | |
Fireworks AI | text | text | $0.20 | $0.20 | |
Together.ai | text | text | $0.18 | $0.18 | |
Together.ai | text | text | $0.90 | $0.90 | |
Groq | image | text | $0.18 | $0.18 | |
Groq | text | text | $0.04 | $0.04 | |
Groq | text | text | $0.06 | $0.06 | |
Together.ai | text | text | $0.06 | $0.06 | |
Groq | image | text | $0.90 | $0.90 | |
Fireworks AI | text | text | $0.90 | $0.90 | |
Together.ai | text | text | $0.88 | $0.88 | |
Groq | text | text | $0.59 | $0.59 | |
Groq | text | text | $0.59 | $0.79 |