inception
Inception: Mercury 2
Mercury 2 is an extremely fast reasoning LLM, and the first reasoning diffusion LLM (dLLM). Instead of generating tokens sequentially, Mercury 2 produces and refines multiple tokens in parallel, achieving...
Context
128k tokens
Input / 1M tokens
$0.25
Output / 1M tokens
$0.75
Benchmark
—
Pricing
Input tokens$0.25 per 1M tokens
Output tokens$0.75 per 1M tokens
Cache read$0.02 per 1M tokens
Technical details
Model IDinception/mercury-2
Context window128k tokens
Input modalitiestext
Output modalitiestext
TokenizerOther
Max output tokens50,000
Use with MegaBrain
import OpenAI from 'openai'
const client = new OpenAI({
baseURL: 'https://getmegabrain.com/api/gateway/v1',
apiKey: process.env.MEGABRAIN_API_KEY,
})
const response = await client.chat.completions.create({
model: 'inception/mercury-2',
messages: [{ role: 'user', content: 'Hello!' }],
})Ready to use Inception: Mercury 2?
Get an API key and start making requests in minutes.
Get an API key