z-ai
Z.ai: GLM 4.5 (retires Dec 31)
GLM-4.5 is our latest flagship foundation model, purpose-built for agent-based applications. It leverages a Mixture-of-Experts (MoE) architecture and supports a context length of up to 128k tokens. GLM-4.5 delivers significantly...
Context
131k tokens
Input / 1M tokens
$0.60
Output / 1M tokens
$2.20
Benchmark
—
Pricing
Input tokens$0.60 per 1M tokens
Output tokens$2.20 per 1M tokens
Cache read$0.11 per 1M tokens
Technical details
Model IDz-ai/glm-4.5
Context window131k tokens
Input modalitiestext
Output modalitiestext
TokenizerOther
Max output tokens98,304
Use with MegaBrain
import OpenAI from 'openai'
const client = new OpenAI({
baseURL: 'https://getmegabrain.com/api/gateway/v1',
apiKey: process.env.MEGABRAIN_API_KEY,
})
const response = await client.chat.completions.create({
model: 'z-ai/glm-4.5',
messages: [{ role: 'user', content: 'Hello!' }],
})Ready to use Z.ai: GLM 4.5 (retires Dec 31)?
Get an API key and start making requests in minutes.
Get an API key