googleVision

Google: Gemini 3.1 Flash Lite

Gemini 3.1 Flash Lite is Google’s GA high-efficiency multimodal model optimized for low-latency, high-volume workloads. It supports text, image, video, audio, and PDF inputs, and is designed for lightweight agentic...

Context
1M tokens
Input / 1M tokens
$0.25
Output / 1M tokens
$1.50
Benchmark

Pricing

Input tokens$0.25 per 1M tokens
Output tokens$1.50 per 1M tokens
Cache read$0.02 per 1M tokens
Cache write$0.08 per 1M tokens
Image$0.25 per image

Technical details

Model IDgoogle/gemini-3.1-flash-lite
Context window1M tokens
Input modalitiestext, image, video, file, audio
Output modalitiestext
TokenizerGemini
Max output tokens65,536

Use with MegaBrain

import OpenAI from 'openai'

const client = new OpenAI({
  baseURL: 'https://getmegabrain.com/api/gateway/v1',
  apiKey: process.env.MEGABRAIN_API_KEY,
})

const response = await client.chat.completions.create({
  model: 'google/gemini-3.1-flash-lite',
  messages: [{ role: 'user', content: 'Hello!' }],
})

Ready to use Google: Gemini 3.1 Flash Lite?

Get an API key and start making requests in minutes.

Get an API key