Models/Gemini 1.5 Flash - 8B
GoogleGoogle / Gemini 1.5 Flash - 8B
Released: 9/24/2024
texttext
Input: $0.038 / Output: $0.15

Gemini 1.5 Flash-8B is a Small Language Model (SLM) designed for high-volume, cost-effective applications. It excels in tasks that require speed and efficiency, such as chat, transcription, and long context language translation.

Some other noteworthy features of Gemini 1.5 Flash-8B include its ability to process multimodal inputs (text, images, audio, and video) and generate structured outputs like JSON.

MetricValue
Parameter Count8 billion
Mixture of ExpertsNo
Context Length1,000,000 tokens
MultilingualYes
Quantized*Unknown

*Quantization is specific to the inference provider and the model may be offered with different quantization levels by other providers.

Google models available on Oxen.ai
ModalityPrice (1M tokens)
ModelInference providerInputOutputInputOutput
Google
texttext$0.08$0.30
Google
texttext$0.04$0.15
Google
texttext$1.25$5.00
Groq
texttext$0.20$0.20
Google
textembeddings$0.02$0.02
See all models available on Oxen.ai