Models/Llama 3.2 90B Vision (Preview)
MetaMeta / Llama 3.2 90B Vision (Preview)
Released: 9/25/2024
imagetext
Input: $0.90 / Output: $0.90

Llama 3.2 90B Vision (Preview) is a Multimodal LLM designed for visual question answering, image captioning, and document visual question answering. It excels in general knowledge, long-form text generation, multilingual translation, coding, math, and advanced reasoning.

Some noteworthy use cases of Llama 3.2 90B Vision (Preview) include image-text retrieval, visual grounding, and visual reasoning.

MetricValue
Parameter Count90 billion
Mixture of ExpertsUnknown
Context Length128,000 tokens
MultilingualYes (text-only)
Quantized*Unknown

*Quantization is specific to the inference provider and may vary by provider.

Meta models available on Oxen.ai
ModalityPrice (1M tokens)
ModelInference providerInputOutputInputOutput
Fireworks AI
texttext$3.00$3.00
Together.ai
texttext$3.50$3.50
Fireworks AI
texttext$0.90$0.90
Together.ai
texttext$0.88$0.88
Groq
texttext$0.05$0.08
Fireworks AI
texttext$0.20$0.20
Together.ai
texttext$0.18$0.18
Together.ai
texttext$0.90$0.90
Groq
imagetext$0.18$0.18
Groq
texttext$0.04$0.04
Groq
texttext$0.06$0.06
Together.ai
texttext$0.06$0.06
Groq
imagetext$0.90$0.90
Fireworks AI
texttext$0.90$0.90
Together.ai
texttext$0.88$0.88
Groq
texttext$0.59$0.59
Groq
texttext$0.59$0.79
See all models available on Oxen.ai