qwenVision

Qwen: Qwen3 VL 32B Instruct

Qwen3-VL-32B-Instruct is a large-scale multimodal vision-language model designed for high-precision understanding and reasoning across text, images, and video. With 32 billion parameters, it combines deep visual perception with advanced text...

Context

262k tokens

Input / 1M tokens

$0.10

Output / 1M tokens

$0.42

Benchmark

—

Pricing

Input tokens$0.10 per 1M tokens

Output tokens$0.42 per 1M tokens

Technical details

Model IDqwen/qwen3-vl-32b-instruct

Context window262k tokens

Input modalitiestext, image

Output modalitiestext

TokenizerQwen

Max output tokens32,768

Use with MegaBrain

import OpenAI from 'openai'

const client = new OpenAI({
  baseURL: 'https://getmegabrain.com/api/gateway/v1',
  apiKey: process.env.MEGABRAIN_API_KEY,
})

const response = await client.chat.completions.create({
  model: 'qwen/qwen3-vl-32b-instruct',
  messages: [{ role: 'user', content: 'Hello!' }],
})

Ready to use Qwen: Qwen3 VL 32B Instruct?

Get an API key and start making requests in minutes.

Get an API key

← All models