Google: Gemini 2.5 Flash Lite

google/gemini-2.5-flash-lite

Released Jul 22, 2025Knowledge cutoff Jan 31, 20251,048,576 context$0.10/M input tokens$0.40/M output tokens

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance across common benchmarks compared to earlier Flash models. By default, "thinking" (i.e. multi-pass reasoning) is disabled to prioritize speed, but developers can enable it via the Reasoning API parameter to selectively trade off cost for intelligence.

OpenRouter

Product

Chat
Rankings
Apps
Models
Providers
Pricing
Enterprise
Labs

Company

About
Announcements
CareersHiring
Privacy
Terms of Service
Support
State of AI
Works With OR
Data

Developer

Documentation
API Reference
SDK
Status

Connect

Discord
GitHub
LinkedIn
X
YouTube

Google: Gemini 2.5 Flash Lite

google/gemini-2.5-flash-lite

Released Jul 22, 2025Knowledge cutoff Jan 31, 20251,048,576 context$0.10/M input tokens$0.40/M output tokens

Recent activity on Gemini 2.5 Flash Lite

Total usage per day on OpenRouter

Prompt

71.3B

Completion

6.85B

Reasoning

444M

Prompt tokens measure input size. Reasoning tokens show internal thinking before a response. Completion tokens reflect total output length.