inception

Inception: Mercury 2

Mercury 2 is an extremely fast reasoning LLM, and the first reasoning diffusion LLM (dLLM). Instead of generating tokens sequentially, Mercury 2 produces and refines multiple tokens in parallel, achieving...

Context

128k tokens

Input / 1M tokens

$0.25

Output / 1M tokens

$0.75

Benchmark

—

Pricing

Input tokens$0.25 per 1M tokens

Output tokens$0.75 per 1M tokens

Cache read$0.02 per 1M tokens

Technical details

Model IDinception/mercury-2

Context window128k tokens

Input modalitiestext

Output modalitiestext

TokenizerOther

Max output tokens50,000

Use with MegaBrain

import OpenAI from 'openai'

const client = new OpenAI({
  baseURL: 'https://getmegabrain.com/api/gateway/v1',
  apiKey: process.env.MEGABRAIN_API_KEY,
})

const response = await client.chat.completions.create({
  model: 'inception/mercury-2',
  messages: [{ role: 'user', content: 'Hello!' }],
})

Ready to use Inception: Mercury 2?

Get an API key and start making requests in minutes.

Get an API key

← All models