Meta / Llama 3.2 90B Vision (Preview)
Released: 9/25/2024imagetext
Input: $0.90 / Output: $0.90
Llama 3.2 90B Vision (Preview) is a Multimodal LLM designed for visual question answering, image captioning, and document visual question answering. It excels in general knowledge, long-form text generation, multilingual translation, coding, math, and advanced reasoning.
Some noteworthy use cases of Llama 3.2 90B Vision (Preview) include image-text retrieval, visual grounding, and visual reasoning.
Metric | Value |
---|---|
Parameter Count | 90 billion |
Mixture of Experts | Unknown |
Context Length | 128,000 tokens |
Multilingual | Yes (text-only) |
Quantized* | Unknown |
*Quantization is specific to the inference provider and may vary by provider.
Meta models available on Oxen.ai
Modality | Price (1M tokens) | ||||
---|---|---|---|---|---|
Model | Inference provider | Input | Output | Input | Output |
![]() | text | text | $3.00 | $3.00 | |
text | text | $3.50 | $3.50 | ||
![]() | text | text | $0.90 | $0.90 | |
text | text | $0.88 | $0.88 | ||
![]() | text | text | $0.05 | $0.08 | |
![]() | text | text | $0.20 | $0.20 | |
text | text | $0.18 | $0.18 | ||
text | text | $0.90 | $0.90 | ||
![]() | image | text | $0.18 | $0.18 | |
![]() | text | text | $0.04 | $0.04 | |
![]() | text | text | $0.06 | $0.06 | |
text | text | $0.06 | $0.06 | ||
![]() | image | text | $0.90 | $0.90 | |
![]() | text | text | $0.90 | $0.90 | |
text | text | $0.88 | $0.88 | ||
![]() | text | text | $0.59 | $0.59 | |
![]() | text | text | $0.59 | $0.79 |