Gemini 2.5 Flash Lite

Release Date: Jul 22, 2025     Creator: Google

An efficiency-focused model from the Gemini family, engineered for high-speed processing and low-latency responses.

Model Specifications
Knowledge Cutoff
January 2025
Context (VividLLM)
64,000 Tokens
With a context window of 64,000 tokens, this model can 'remember' and analyze up to 96 pages of data at once. This makes it a top choice for processing technical manuals and multi-chapter reports.
Context (Native)
1,048,576 Tokens
Input ModalitiesTextImagePDF
Output TypeText
Max Output8192
Input Weight
0.50 x (Casual)
Output Weight
0.50 x (Casual)
Industry Benchmarks
Intelligence Index (Artificial Analysis) N/A
Coding Index (Artificial Analysis)N/A
Math Index (Artificial Analysis)N/A
Response Speed
Output Tokens per second N/A
Median Time to First Token SecondsN/A
Launch VividLLM Now => :

Benchmarks accessed via : Artificial Analysis

Disclaimer: We are providing Image Upload options to only limit models as of now, so some models which have image upload options might now have that feature on our site.