googleVision
Google: Gemini 2.5 Flash Lite
Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance...
Context
1M tokens
Input / 1M tokens
$0.10
Output / 1M tokens
$0.40
Benchmark
—
Pricing
Input tokens$0.10 per 1M tokens
Output tokens$0.40 per 1M tokens
Cache read$0.01 per 1M tokens
Cache write$0.08 per 1M tokens
Image$0.10 per image
Technical details
Model IDgoogle/gemini-2.5-flash-lite
Context window1M tokens
Input modalitiestext, image, file, audio, video
Output modalitiestext
TokenizerGemini
Max output tokens65,535
Use with MegaBrain
import OpenAI from 'openai'
const client = new OpenAI({
baseURL: 'https://getmegabrain.com/api/gateway/v1',
apiKey: process.env.MEGABRAIN_API_KEY,
})
const response = await client.chat.completions.create({
model: 'google/gemini-2.5-flash-lite',
messages: [{ role: 'user', content: 'Hello!' }],
})Ready to use Google: Gemini 2.5 Flash Lite?
Get an API key and start making requests in minutes.
Get an API key