deepseek
DeepSeek: DeepSeek V4 Flash
DeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts model from DeepSeek with 284B total parameters and 13B activated parameters, supporting a 1M-token context window. It is designed for fast inference and...
Context
1M tokens
Input / 1M tokens
$0.09
Output / 1M tokens
$0.18
Benchmark
—
Pricing
Input tokens$0.09 per 1M tokens
Output tokens$0.18 per 1M tokens
Cache read$0.02 per 1M tokens
Technical details
Model IDdeepseek/deepseek-v4-flash
Context window1M tokens
Input modalitiestext
Output modalitiestext
TokenizerDeepSeek
Max output tokens65,536
Use with MegaBrain
import OpenAI from 'openai'
const client = new OpenAI({
baseURL: 'https://getmegabrain.com/api/gateway/v1',
apiKey: process.env.MEGABRAIN_API_KEY,
})
const response = await client.chat.completions.create({
model: 'deepseek/deepseek-v4-flash',
messages: [{ role: 'user', content: 'Hello!' }],
})Ready to use DeepSeek: DeepSeek V4 Flash?
Get an API key and start making requests in minutes.
Get an API key