inclusionai

inclusionAI: Ling-2.6-flash

Ling-2.6-flash is an instant (instruct) model from inclusionAI with 104B total parameters and 7.4B active parameters, designed for real-world agents that require fast responses, strong execution, and high token efficiency....

Context

262k tokens

Input / 1M tokens

$0.01

Output / 1M tokens

$0.03

Benchmark

—

Pricing

Input tokens$0.01 per 1M tokens

Output tokens$0.03 per 1M tokens

Cache read$0.00 per 1M tokens

Technical details

Model IDinclusionai/ling-2.6-flash

Context window262k tokens

Input modalitiestext

Output modalitiestext

TokenizerOther

Max output tokens32,768

Use with MegaBrain

import OpenAI from 'openai'

const client = new OpenAI({
  baseURL: 'https://getmegabrain.com/api/gateway/v1',
  apiKey: process.env.MEGABRAIN_API_KEY,
})

const response = await client.chat.completions.create({
  model: 'inclusionai/ling-2.6-flash',
  messages: [{ role: 'user', content: 'Hello!' }],
})

Ready to use inclusionAI: Ling-2.6-flash?

Get an API key and start making requests in minutes.

Get an API key

← All models