Meta / Llama 3.3 70B Speculative Decoding
Released: 12/6/2024texttext
Input: $0.59 / Output: $0.59
Llama 3.3 70B Speculative Decoding is an LLM that excels in a wide variety of tasks, such as question answering, reasoning, and code generation.
Some noteworthy use cases of Llama 3.3 70B Speculative Decoding include synthetic data generation and judging outputs from smaller models.
Metric | Value |
---|---|
Parameter Count | 70 billion |
Mixture of Experts | No |
Context Length | 130,000 tokens |
Multilingual | Yes |
Quantized* | Unknown |
*Quantization is specific to the inference provider and the model may be offered with different quantization levels by other providers.
Meta models available on Oxen.ai
Modality | Price (1M tokens) | ||||
---|---|---|---|---|---|
Model | Inference provider | Input | Output | Input | Output |
![]() | text | text | $3.00 | $3.00 | |
text | text | $3.50 | $3.50 | ||
![]() | text | text | $0.90 | $0.90 | |
text | text | $0.88 | $0.88 | ||
![]() | text | text | $0.05 | $0.08 | |
![]() | text | text | $0.20 | $0.20 | |
text | text | $0.18 | $0.18 | ||
text | text | $0.90 | $0.90 | ||
![]() | image | text | $0.18 | $0.18 | |
![]() | text | text | $0.04 | $0.04 | |
![]() | text | text | $0.06 | $0.06 | |
text | text | $0.06 | $0.06 | ||
![]() | image | text | $0.90 | $0.90 | |
![]() | text | text | $0.90 | $0.90 | |
text | text | $0.88 | $0.88 | ||
![]() | text | text | $0.59 | $0.59 | |
![]() | text | text | $0.59 | $0.79 |