z-ai

Z.ai: GLM 4.7 Flash

As a 30B-class SOTA model, GLM-4.7-Flash offers a new option that balances performance and efficiency. It is further optimized for agentic coding use cases, strengthening coding capabilities, long-horizon task planning,...

Context

203k tokens

Input / 1M tokens

$0.06

Output / 1M tokens

$0.40

Benchmark

—

Pricing

Input tokens$0.06 per 1M tokens

Output tokens$0.40 per 1M tokens

Cache read$0.01 per 1M tokens

Technical details

Model IDz-ai/glm-4.7-flash

Context window203k tokens

Input modalitiestext

Output modalitiestext

TokenizerOther

Max output tokens16,384

Use with MegaBrain

import OpenAI from 'openai'

const client = new OpenAI({
  baseURL: 'https://getmegabrain.com/api/gateway/v1',
  apiKey: process.env.MEGABRAIN_API_KEY,
})

const response = await client.chat.completions.create({
  model: 'z-ai/glm-4.7-flash',
  messages: [{ role: 'user', content: 'Hello!' }],
})

Ready to use Z.ai: GLM 4.7 Flash?

Get an API key and start making requests in minutes.

Get an API key

← All models