Models/Gemini 1.5 Flash
GoogleGoogle / Gemini 1.5 Flash
Released: 5/14/2024
texttext
Input: $0.075 / Output: $0.30

Gemini 1.5 Flash is a Multimodal LLM designed for high-volume, rapid processing tasks. It excels in efficient handling of multimodal inputs (text, images, audio, and video) while maintaining a balance between performance and computational cost.

Some other noteworthy features of Gemini 1.5 Flash include its ability to process up to 1 million tokens in context and its optimization for speed and efficiency in tasks such as summarization, chat, image and video captioning, and data extraction.

MetricValue
Parameter CountUnknown
Mixture of ExpertsYes
Context Length1,000,000 tokens
MultilingualYes
Quantized*Unknown

*Quantization is specific to the inference provider and the model may be offered with different quantization levels by other providers.

Google models available on Oxen.ai
ModalityPrice (1M tokens)
ModelInference providerInputOutputInputOutput
Google
texttext$0.08$0.30
Google
texttext$0.04$0.15
Google
texttext$1.25$5.00
Groq
texttext$0.20$0.20
Google
textembeddings$0.02$0.02
See all models available on Oxen.ai