Models/Gemini 2.0 Flash Lite
GoogleGoogle / Gemini 2.0 Flash Lite
texttextimagetext
Input: $0.075 / Output: $0.30

Gemini 2.0 Flash Lite is a Multimodal LLM designed for cost-efficient, high-speed applications. It excels in real-time tasks and budget-friendly multimodal interactions, offering improved quality over Gemini 1.5 Flash while maintaining similar speed and cost.

Noteworthy features include:

  • Multimodal input with text output (multimodal output in private preview)
  • 1M token input context window and 8k token output window
  • Public preview availability with a 60 queries-per-minute (QPM) rate limit
MetricValue
Parameter CountUnknown
Mixture of ExpertsNo
Context Length1M tokens (input)
MultilingualYes
Quantized*Unknown

*Quantization details are provider-specific and not disclosed for this model.

Google models available on Oxen.ai
ModalityPrice (1M tokens)
ModelInference providerInputOutputInputOutput
Google
texttext$0.08$0.30
Google
texttext$0.04$0.15
Google
texttext$1.25$5.00
Google
text, imagetext$0.10$0.40
Google
text, imagetext$0.08$0.30
Groq
texttext$0.20$0.20
Google
textembeddings$0.02$0.02
See all models available on Oxen.ai