qwenVision

Qwen: Qwen3.5-Flash

The Qwen3.5 native vision-language Flash models are built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. Compared to the...

Context

1M tokens

Input / 1M tokens

$0.07

Output / 1M tokens

$0.26

Benchmark

—

Pricing

Input tokens$0.07 per 1M tokens

Output tokens$0.26 per 1M tokens

Technical details

Model IDqwen/qwen3.5-flash-02-23

Context window1M tokens

Input modalitiestext, image, video

Output modalitiestext

TokenizerQwen3

Max output tokens65,536

Use with MegaBrain

import OpenAI from 'openai'

const client = new OpenAI({
  baseURL: 'https://getmegabrain.com/api/gateway/v1',
  apiKey: process.env.MEGABRAIN_API_KEY,
})

const response = await client.chat.completions.create({
  model: 'qwen/qwen3.5-flash-02-23',
  messages: [{ role: 'user', content: 'Hello!' }],
})

Ready to use Qwen: Qwen3.5-Flash?

Get an API key and start making requests in minutes.

Get an API key

← All models