googleVision

Google: Gemini 2.5 Flash Lite Preview 09-2025

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance...

Context

1M tokens

Input / 1M tokens

$0.10

Output / 1M tokens

$0.40

Benchmark

—

Pricing

Input tokens$0.10 per 1M tokens

Output tokens$0.40 per 1M tokens

Cache read$0.01 per 1M tokens

Cache write$0.08 per 1M tokens

Image$0.10 per image

Technical details

Model IDgoogle/gemini-2.5-flash-lite-preview-09-2025

Context window1M tokens

Input modalitiestext, image, file, audio, video

Output modalitiestext

TokenizerGemini

Max output tokens65,535

Use with MegaBrain

import OpenAI from 'openai'

const client = new OpenAI({
  baseURL: 'https://getmegabrain.com/api/gateway/v1',
  apiKey: process.env.MEGABRAIN_API_KEY,
})

const response = await client.chat.completions.create({
  model: 'google/gemini-2.5-flash-lite-preview-09-2025',
  messages: [{ role: 'user', content: 'Hello!' }],
})

Ready to use Google: Gemini 2.5 Flash Lite Preview 09-2025?

Get an API key and start making requests in minutes.

Get an API key

← All models