Google / Gemini 1.5 Flash - 8B
Released: 9/24/2024texttext
Input: $0.038 / Output: $0.15
Gemini 1.5 Flash - 8B is a Multimodal LLM designed for high-volume, cost-effective applications. It excels in transcription, handling long contexts, and tasks requiring efficient processing of multimodal data. The model is particularly suited for applications where cost-effectiveness and speed are prioritized over complex reasoning.
Some noteworthy use cases of Gemini 1.5 Flash - 8B include handling high-volume multimodal tasks, long-context summarization, and transcription tasks.
Metric | Value |
---|---|
Parameter Count | Unknown |
Mixture of Experts | No |
Context Length | 1,048,576 tokens |
Multilingual | Yes |
Quantized* | Unknown |
*Quantization is specific to the inference provider and the model may be offered with different quantization levels by other providers.
Google models available on Oxen.ai
Modality | Price (1M tokens) | ||||
---|---|---|---|---|---|
Model | Inference provider | Input | Output | Input | Output |
![]() | text | text | $0.08 | $0.30 | |
![]() | text | text | $0.04 | $0.15 | |
![]() | text | text | $1.25 | $5.00 | |
![]() | text, image | text | $0.10 | $0.40 | |
![]() | text, image | text | $0.08 | $0.30 | |
![]() | text | text | $0.20 | $0.20 | |
![]() | text | embeddings | $0.02 | $0.02 |