Models/Gemini 1.5 Flash - 8B
GoogleGoogle / Gemini 1.5 Flash - 8B
Released: 9/24/2024
texttext
Input: $0.038 / Output: $0.15

Gemini 1.5 Flash - 8B is a Multimodal LLM designed for high-volume, cost-effective applications. It excels in transcription, handling long contexts, and tasks requiring efficient processing of multimodal data. The model is particularly suited for applications where cost-effectiveness and speed are prioritized over complex reasoning.

Some noteworthy use cases of Gemini 1.5 Flash - 8B include handling high-volume multimodal tasks, long-context summarization, and transcription tasks.

MetricValue
Parameter CountUnknown
Mixture of ExpertsNo
Context Length1,048,576 tokens
MultilingualYes
Quantized*Unknown

*Quantization is specific to the inference provider and the model may be offered with different quantization levels by other providers.

Google models available on Oxen.ai
ModalityPrice (1M tokens)
ModelInference providerInputOutputInputOutput
Google
texttext$0.08$0.30
Google
texttext$0.04$0.15
Google
texttext$1.25$5.00
Google
text, imagetext$0.10$0.40
Google
text, imagetext$0.08$0.30
Groq
texttext$0.20$0.20
Google
textembeddings$0.02$0.02
See all models available on Oxen.ai