z-ai
Z.ai: GLM 4.7 Flash
As a 30B-class SOTA model, GLM-4.7-Flash offers a new option that balances performance and efficiency. It is further optimized for agentic coding use cases, strengthening coding capabilities, long-horizon task planning,...
Context
203k tokens
Input / 1M tokens
$0.06
Output / 1M tokens
$0.40
Benchmark
—
Pricing
Input tokens$0.06 per 1M tokens
Output tokens$0.40 per 1M tokens
Cache read$0.01 per 1M tokens
Technical details
Model IDz-ai/glm-4.7-flash
Context window203k tokens
Input modalitiestext
Output modalitiestext
TokenizerOther
Max output tokens16,384
Use with MegaBrain
import OpenAI from 'openai'
const client = new OpenAI({
baseURL: 'https://getmegabrain.com/api/gateway/v1',
apiKey: process.env.MEGABRAIN_API_KEY,
})
const response = await client.chat.completions.create({
model: 'z-ai/glm-4.7-flash',
messages: [{ role: 'user', content: 'Hello!' }],
})