inclusionai
inclusionAI: Ling-2.6-flash
Ling-2.6-flash is an instant (instruct) model from inclusionAI with 104B total parameters and 7.4B active parameters, designed for real-world agents that require fast responses, strong execution, and high token efficiency....
Context
262k tokens
Input / 1M tokens
$0.01
Output / 1M tokens
$0.03
Benchmark
—
Pricing
Input tokens$0.01 per 1M tokens
Output tokens$0.03 per 1M tokens
Cache read$0.00 per 1M tokens
Technical details
Model IDinclusionai/ling-2.6-flash
Context window262k tokens
Input modalitiestext
Output modalitiestext
TokenizerOther
Max output tokens32,768
Use with MegaBrain
import OpenAI from 'openai'
const client = new OpenAI({
baseURL: 'https://getmegabrain.com/api/gateway/v1',
apiKey: process.env.MEGABRAIN_API_KEY,
})
const response = await client.chat.completions.create({
model: 'inclusionai/ling-2.6-flash',
messages: [{ role: 'user', content: 'Hello!' }],
})Ready to use inclusionAI: Ling-2.6-flash?
Get an API key and start making requests in minutes.
Get an API key