Run Models on Your Data

Choose the right model, get to the perfect prompt, kick off the data flywheel.
Oxen makes it easy to improve your use of state of the art AI.

26 models, from 5 providers. New models added every week.


mistral
Mistral AI
Ministral 3B
High quality edge model with only 3B parameters. Test this model through our UI, and then download the weights and run it on the edge on edge devices.
Input: $0.04 / Output: $0.04
texttext
Ministral 8B
Powerful edge model with extremely high performance/price ratio.
Input: $0.10 / Output: $0.10
texttext
Pixtral 12B
A 12B model with image understanding capabilities in addition to text.
Input: $0.15 / Output: $0.15
texttext
Mistral Nemo
Multilingual open source model.
Input: $0.15 / Output: $0.15
texttext
Open Mistral 7B
Decoder-only transformer for chat based purposes.
Input: $0.25 / Output: $0.25
texttext
Mistral Small
Enterprise-grade 22B parameter small model with the lastest version v2 released September 2024.
Input: $0.20 / Output: $0.60
texttext
Codestral
Cutting-edge language model for coding.
Input: $0.20 / Output: $0.60
texttext
Open Mixtral 8x7B
Sparse Mixture-of-Experts (MoE) model with a total of 45 billion parameters. Best model overall regarding cost/performance trade-offs.
Input: $0.70 / Output: $0.70
texttext
Mistral Large
Reasoning model for high-complexity tasks with the lastest version v2 released July 2024.
Input: $2.00 / Output: $6.00
texttext
Open Mixtral 8x22B
Sparse Mixture-of-Experts (SMoE) model that uses only 39B active parameters out of 141B, offering unparalleled cost efficiency for its size.
Input: $2.00 / Output: $6.00
texttext

openai
OpenAI
Text Embedding 3 - Small
Smaller text embeddings with a dimension of
Input: $0.02 / Output: $0.02
textembedding
Text Embedding 3 - Large
Larger text embeddings
Input: $0.13 / Output: $0.13
textembedding
GPT-4o mini
Affordable and intelligent small model for fast, lightweight tasks
Input: $0.15 / Output: $0.60
texttext
GPT-4o
High-intelligence flagship model for complex, multistep tasks.
Input: $2.50 / Output: $10.00
texttext
o1 mini
More efficient than o1-preview with similar reasoning capabilities.
Input: $3.00 / Output: $12.00
texttext
o1 preview
Language model trained with reinforcement learning to perform complex reasoning.
Input: $15.00 / Output: $60.00
texttext

google
Google
Text Embedding 004
Gemini API generates state-of-the-art embeddings for words, phrases, and sentences
Input: $0.02 / Output: $0.02
textembedding
Gemini 1.5 Flash - 8B
High volume and lower intelligence tasks
Input: $0.04 / Output: $0.15
texttext
Gemini 1.5 Flash
Our fastest multimodal model with great performance for diverse, repetitive tasks and a 1 million context window. Now generally available for production
Input: $0.08 / Output: $0.30
texttext
Gemini 1.5 Pro
Gemini 1.5 Pro is a mid-size multimodal model that is optimized for a wide-range of reasoning tasks.
Input: $1.25 / Output: $5.00
texttext

fireworks
Fireworks AI
Llama v3.1 8B Instruct
The Meta Llama 3.1 collection of multilingual large language models (LLMs) is a collection of pretrained and instruction tuned generative models with 8B
Input: $0.20 / Output: $0.20
texttext
Llama v3.1 70B Instruct
The Meta Llama 3.1 collection of multilingual large language models (LLMs) is a collection of pretrained and instruction tuned generative models with 70B
Input: $0.90 / Output: $0.90
texttext
Llama v3.1 405B Instruct
The Meta Llama 3.1 collection of multilingual large language models (LLMs) is a collection of pretrained and instruction tuned generative models with 405B
Input: $3.00 / Output: $3.00
texttext

togetherai
Together.ai
Meta Llama 3.1 8B Instruct Turbo
8 billion parameter instruct-tuned decoder-only handles complex language tasks with high accuracy and efficiency.
Input: $0.18 / Output: $0.18
texttext
Meta Llama 3.1 70B Instruct Turbo
70 billion parameter instruct-tuned decoder-only handles complex language tasks with high accuracy and efficiency.
Input: $0.88 / Output: $0.88
texttext
Meta Llama 3.1 405B Instruct Turbo
405 billion parameter instruct-tuned decoder-only handles complex language tasks with high accuracy and efficiency.
Input: $3.50 / Output: $3.50
texttext