Google / Gemini 1.5 Flash
Released: 5/14/2024texttext
Input: $0.075 / Output: $0.30
Gemini 1.5 Flash is a Multimodal LLM designed for high-volume, rapid processing tasks. It excels in efficient handling of multimodal inputs (text, images, audio, and video) while maintaining a balance between performance and computational cost.
Some other noteworthy features of Gemini 1.5 Flash include its ability to process up to 1 million tokens in context and its optimization for speed and efficiency in tasks such as summarization, chat, image and video captioning, and data extraction.
Metric | Value |
---|---|
Parameter Count | Unknown |
Mixture of Experts | Yes |
Context Length | 1,000,000 tokens |
Multilingual | Yes |
Quantized* | Unknown |
*Quantization is specific to the inference provider and the model may be offered with different quantization levels by other providers.
Google models available on Oxen.ai
Modality | Price (1M tokens) | ||||
---|---|---|---|---|---|
Model | Inference provider | Input | Output | Input | Output |
Google | text | text | $0.08 | $0.30 | |
Google | text | text | $0.04 | $0.15 | |
Google | text | text | $1.25 | $5.00 | |
Groq | text | text | $0.20 | $0.20 | |
Google | text | embeddings | $0.02 | $0.02 |