Gemini 3.1 Flash Lite Preview

Release Date: Mar 3, 2026     Creator: Google

A cost-efficient multimodal model from the Gemini, suitable for general-purpose tasks.

Model Specifications
Knowledge Cutoff
January 2025
Context (VividLLM)
64,000 Tokens
With a context window of 64,000 tokens, this model can 'remember' and analyze up to 96 pages of data at once. This makes it a top choice for processing technical manuals and multi-chapter reports.
Context (Native)
1,048,576 Tokens
Input ModalitiesTextImagePDF
Output TypeText
Max Output8192
Input Weight
1.00 x (Casual)
Output Weight
1.00 x (Casual)
Industry Benchmarks
Intelligence Index (Artificial Analysis) N/A
Coding Index (Artificial Analysis)N/A
Math Index (Artificial Analysis)N/A
Response Speed
Output Tokens per second N/A
Median Time to First Token SecondsN/A
Launch VividLLM Now => :

Benchmarks accessed via : Artificial Analysis

Disclaimer: We are providing Image Upload options to only limit models as of now, so some models which have image upload options might now have that feature on our site.