Google / Gemini 2.0 Flash
texttextimagetext
Input: $0.10 / Output: $0.40
Gemini 2.0 Flash is a Multimodal LLM designed for building advanced agentic applications, excelling in multi-step task execution and real-time data integration. It supports multimodal inputs (text, images, audio, video) and outputs (text, images, speech), with enhanced reasoning capabilities through its Thinking Mode that reduces hallucinations and improves accuracy. Key strengths include integration with Google tools (Search, Maps, code execution) and third-party functions via the Multimodal Live API.
Some noteworthy use cases include:
- Real-time media analysis leveraging multimodal inputs and outputs
- Complex query handling with advanced reasoning for context-aware responses
- Dynamic decision-making through live data interaction and tool integration
Metric | Value |
---|---|
Parameter Count | Unknown |
Mixture of Experts | No |
Context Length | 1,048,576 tokens |
Multilingual | Yes |
Quantized* | Unknown |
*Quantization details are provider-specific and not disclosed for this model.
Google models available on Oxen.ai
Modality | Price (1M tokens) | ||||
---|---|---|---|---|---|
Model | Inference provider | Input | Output | Input | Output |
![]() | text | text | $0.08 | $0.30 | |
![]() | text | text | $0.04 | $0.15 | |
![]() | text | text | $1.25 | $5.00 | |
![]() | text, image | text | $0.10 | $0.40 | |
![]() | text, image | text | $0.08 | $0.30 | |
![]() | text | text | $0.20 | $0.20 | |
![]() | text | embeddings | $0.02 | $0.02 |