Run Gemini 1.5 Flash - 8B on your data

Gemini 1.5 Flash - 8B is a Multimodal LLM designed for high-volume, cost-effective applications. It excels in transcription, handling long contexts, and tasks requiring efficient processing of multimodal data. The model is particularly suited for applications where cost-effectiveness and speed are prioritized over complex reasoning.

Some noteworthy use cases of Gemini 1.5 Flash - 8B include handling high-volume multimodal tasks, long-context summarization, and transcription tasks.

Metric	Value
Parameter Count	Unknown
Mixture of Experts	No
Context Length	1,048,576 tokens
Multilingual	Yes
Quantized*	Unknown

*Quantization is specific to the inference provider and the model may be offered with different quantization levels by other providers.

	Modality	Price (1M tokens)
Gemini 1.5 Flash	Google	text	text	$0.08	$0.30
Gemini 1.5 Flash - 8B	Google	text	text	$0.04	$0.15
Gemini 1.5 Pro	Google	text	text	$1.25	$5.00
Gemini 2.0 Flash	Google	text, image	text	$0.10	$0.40
Gemini 2.0 Flash Lite	Google	text, image	text	$0.08	$0.30
Gemini 2.0 Pro	Google	text	text	$1.25	$5.00
Gemini 2.5 Pro Experimental	Google	text, image	text	$2.50	$5.00
Gemma 2 9B Instruct	Groq	text	text	$0.20	$0.20
Gemma 3 27B	Google	text	text	$0.20	$0.40
Text Embedding 004	Google	text	embeddings	$0.02	$0.02

Modality

Price (1M tokens)

Model

Inference provider

Input

Output

Input

Output

Gemini 1.5 Flash

Google