z-ai

Z.ai: GLM 4.5 Air

GLM-4.5-Air is the lightweight variant of our latest flagship model family, also purpose-built for agent-centric applications. Like GLM-4.5, it adopts the Mixture-of-Experts (MoE) architecture but with a more compact parameter...

Context
131k tokens
Input / 1M tokens
$0.13
Output / 1M tokens
$0.85
Benchmark

Pricing

Input tokens$0.13 per 1M tokens
Output tokens$0.85 per 1M tokens
Cache read$0.02 per 1M tokens

Technical details

Model IDz-ai/glm-4.5-air
Context window131k tokens
Input modalitiestext
Output modalitiestext
TokenizerOther
Max output tokens98,304

Use with MegaBrain

import OpenAI from 'openai'

const client = new OpenAI({
  baseURL: 'https://getmegabrain.com/api/gateway/v1',
  apiKey: process.env.MEGABRAIN_API_KEY,
})

const response = await client.chat.completions.create({
  model: 'z-ai/glm-4.5-air',
  messages: [{ role: 'user', content: 'Hello!' }],
})

Ready to use Z.ai: GLM 4.5 Air?

Get an API key and start making requests in minutes.

Get an API key