Run Models on Your Data

Choose the right model, get to the perfect prompt, kick off the data flywheel.
Oxen makes it easy to improve your use of state of the art AI.

36 models, from 7 providers. New models added every week.


mistral
Mistral AI
Ministral 3B
High quality edge model with only 3B parameters. Test this model through our UI, and then download the weights and run it on the edge on edge devices.
Input: $0.04 / Output: $0.04
texttext
Ministral 8B
Powerful edge model with extremely high performance/price ratio.
Input: $0.10 / Output: $0.10
texttext
Pixtral 12B
A 12B model with image understanding capabilities in addition to text.
Input: $0.15 / Output: $0.15
texttext
Mistral Nemo
Multilingual open source model.
Input: $0.15 / Output: $0.15
texttext
Open Mistral 7B
Decoder-only transformer for chat based purposes.
Input: $0.25 / Output: $0.25
texttext
Mistral Small
Enterprise-grade 22B parameter small model with the lastest version v2 released September 2024.
Input: $0.20 / Output: $0.60
texttext
Codestral
Cutting-edge language model for coding.
Input: $0.20 / Output: $0.60
texttext
Open Mixtral 8x7B
Sparse Mixture-of-Experts (MoE) model with a total of 45 billion parameters. Best model overall regarding cost/performance trade-offs.
Input: $0.70 / Output: $0.70
texttext
Mistral Large
Reasoning model for high-complexity tasks with the lastest version v2 released July 2024.
Input: $2.00 / Output: $6.00
texttext
Open Mixtral 8x22B
Sparse Mixture-of-Experts (SMoE) model that uses only 39B active parameters out of 141B, offering unparalleled cost efficiency for its size.
Input: $2.00 / Output: $6.00
texttext

openai
OpenAI
Text Embedding 3 - Small
Smaller text embeddings with a dimension of
Input: $0.02 / Output: $0.02
textembeddings
Text Embedding 3 - Large
Larger text embeddings
Input: $0.13 / Output: $0.13
textembeddings
GPT-4o mini
Affordable and intelligent small model for fast, lightweight tasks
Input: $0.15 / Output: $0.60
texttextimagetext
GPT-4o
High-intelligence flagship model for complex, multistep tasks.
Input: $2.50 / Output: $10.00
texttextimagetext
o1 mini
More efficient than o1-preview with similar reasoning capabilities.
Input: $3.00 / Output: $12.00
texttext
o1 preview
Language model trained with reinforcement learning to perform complex reasoning.
Input: $15.00 / Output: $60.00
texttext

fireworks
Fireworks AI
Llama v3.1 8B Instruct
The Meta Llama 3.1 collection of multilingual large language models (LLMs) is a collection of pretrained and instruction tuned generative models with 8B
Input: $0.20 / Output: $0.20
texttext
Llama v3.1 70B Instruct
The Meta Llama 3.1 collection of multilingual large language models (LLMs) is a collection of pretrained and instruction tuned generative models with 70B
Input: $0.90 / Output: $0.90
texttext
Qwen 2.5 Coder 32B Instruct
Qwen 2.5 Coder is the latest series of code-specific Qwen large language models.
Input: $0.90 / Output: $0.90
texttext
Qwen2.5 72B Instruct
Qwen2.5 are a series of decoder-only language models developed by Qwen team, Alibaba
Input: $0.90 / Output: $0.90
texttext
Qwen2 VL 72B Instruct
The 72B variant of the latest iteration of Qwen-VL model from Alibaba, representing nearly a year of innovation.
Input: $0.90 / Output: $0.90
imagetext
Llama v3.1 405B Instruct
The Meta Llama 3.1 collection of multilingual large language models (LLMs) is a collection of pretrained and instruction tuned generative models with 405B
Input: $3.00 / Output: $3.00
texttext

togetherai
Together.ai
Meta Llama 3.1 8B Instruct Turbo
8 billion parameter instruct-tuned decoder-only handles complex language tasks with high accuracy and efficiency.
Input: $0.18 / Output: $0.18
texttext
Meta Llama 3.1 70B Instruct Turbo
70 billion parameter instruct-tuned decoder-only handles complex language tasks with high accuracy and efficiency.
Input: $0.88 / Output: $0.88
texttext
Llama 3.3 70B Instruct Turbo
More powerful than Llama 3.1 70B
Input: $0.88 / Output: $0.88
texttext
Meta Llama 3.1 405B Instruct Turbo
405 billion parameter instruct-tuned decoder-only handles complex language tasks with high accuracy and efficiency.
Input: $3.50 / Output: $3.50
texttext

google
Google
Text Embedding 004
Gemini API generates state-of-the-art embeddings for words, phrases, and sentences
Input: $0.02 / Output: $0.02
textembeddings
Gemini 1.5 Flash - 8B
High volume and lower intelligence tasks
Input: $0.04 / Output: $0.15
texttext
Gemini 1.5 Flash
Our fastest multimodal model with great performance for diverse, repetitive tasks and a 1 million context window. Now generally available for production
Input: $0.08 / Output: $0.30
texttext
Gemini 1.5 Pro
Gemini 1.5 Pro is a mid-size multimodal model that is optimized for a wide-range of reasoning tasks.
Input: $1.25 / Output: $5.00
texttext

groq
Groq
Llama 3.2 11B Vision (Preview)
A powerful multimodal model capable of processing both text and image inputs that supports multilingual, multi-turn conversations, tool use, and JSON mode.
Input: $0.18 / Output: $0.18
imagetext
Llama 3.3 70B Speculative Decoding
The Meta Llama 3.3 multilingual large language model (LLM) is a pretrained and instruction tuned generative model in 70B (text in/text out).
Input: $0.59 / Output: $0.59
texttext
Llama 3.2 90B Vision (Preview)
A powerful multimodal model capable of processing both text and image inputs that supports multilingual, multi-turn conversations, tool use, and JSON mode.
Input: $0.90 / Output: $0.90
imagetext

lambda
Lambda Labs
Hermes 3 8B
Hermes 3 is the latest version of the flagship Hermes series of LLMs by Nous Research
Input: $0.03 / Output: $0.03
texttext
Hermes 3 70B
Hermes 3 is the latest version of the flagship Hermes series of LLMs by Nous Research
Input: $0.20 / Output: $0.20
texttext
Hermes 3 405B
Hermes 3 is the latest version of the flagship Hermes series of LLMs by Nous Research
Input: $0.90 / Output: $0.90
texttext