Zhipu AI·GLM family

GLM-5.2

Open WeightsMoEmitReleased Jun 2026

Z.ai's (formerly Zhipu AI) flagship open-weight coding model with a 1M-token context window. Mixture-of-Experts architecture with 753B total parameters and ~40B active per request, featuring two cost-balancing reasoning modes. Tops several coding benchmarks while remaining a fraction of the cost of comparable proprietary frontier models. MIT-licensed weights.

Capabilities

chatcompletionfunction-callingcode-generationreasoning

Modalities

textcode

Context Window

1.0M tokens

Providers

available

Available from 2 providers

Cheapest

OpenRouter

$2.60/1M tokens

OpenRouter, Zhipu AI

Providers (2)

Sorted by total cost (input + output per 1M tokens). Click a row to view provider details.

Provider	Pricing (per 1M)	Rate Limits	Regions	Health	Latency
OpenRouter	In: $0.60Out: $2.00	200 RPM / 300K TPM	us-east-1	Healthy	0ms
Zhipu AI	In: $0.60Out: $2.00	300 RPM / 500K TPM	us-east-1global	Healthy	0ms

Quick Start

Use this model via OpenRouter with an OpenAI-compatible SDK.

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://openrouter.ai/api/v1",
  apiKey: process.env.OPENROUTER_API_KEY,
});

const response = await client.chat.completions.create({
  model: "z-ai/glm-5.2",
  messages: [
    { role: "user", content: "Hello!" }
  ],
});

console.log(response.choices[0].message.content);

Using OpenRouter API • OpenAI-compatible SDK