deepseek

DeepSeek: DeepSeek V4 Flash

DeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts model from DeepSeek with 284B total parameters and 13B activated parameters, supporting a 1M-token context window. It is designed for fast inference and...

Context

1M tokens

Input / 1M tokens

$0.09

Output / 1M tokens

$0.18

Benchmark

—

Pricing

Input tokens$0.09 per 1M tokens

Output tokens$0.18 per 1M tokens

Cache read$0.02 per 1M tokens

Technical details

Model IDdeepseek/deepseek-v4-flash

Context window1M tokens

Input modalitiestext

Output modalitiestext

TokenizerDeepSeek

Max output tokens65,536

Use with MegaBrain

import OpenAI from 'openai'

const client = new OpenAI({
  baseURL: 'https://getmegabrain.com/api/gateway/v1',
  apiKey: process.env.MEGABRAIN_API_KEY,
})

const response = await client.chat.completions.create({
  model: 'deepseek/deepseek-v4-flash',
  messages: [{ role: 'user', content: 'Hello!' }],
})

Ready to use DeepSeek: DeepSeek V4 Flash?

Get an API key and start making requests in minutes.

Get an API key

← All models