Models/Gemini 2.0 Flash
GoogleGoogle / Gemini 2.0 Flash
texttextimagetext
Input: $0.10 / Output: $0.40

Gemini 2.0 Flash is a Multimodal LLM designed for building advanced agentic applications, excelling in multi-step task execution and real-time data integration. It supports multimodal inputs (text, images, audio, video) and outputs (text, images, speech), with enhanced reasoning capabilities through its Thinking Mode that reduces hallucinations and improves accuracy. Key strengths include integration with Google tools (Search, Maps, code execution) and third-party functions via the Multimodal Live API.

Some noteworthy use cases include:

  • Real-time media analysis leveraging multimodal inputs and outputs
  • Complex query handling with advanced reasoning for context-aware responses
  • Dynamic decision-making through live data interaction and tool integration
MetricValue
Parameter CountUnknown
Mixture of ExpertsNo
Context Length1,048,576 tokens
MultilingualYes
Quantized*Unknown

*Quantization details are provider-specific and not disclosed for this model.

Google models available on Oxen.ai
ModalityPrice (1M tokens)
ModelInference providerInputOutputInputOutput
Google
texttext$0.08$0.30
Google
texttext$0.04$0.15
Google
texttext$1.25$5.00
Google
text, imagetext$0.10$0.40
Google
text, imagetext$0.08$0.30
Groq
texttext$0.20$0.20
Google
textembeddings$0.02$0.02
See all models available on Oxen.ai