How much does Z.ai: GLM 4.6 cost?

Z.ai: GLM 4.6 costs $0.43 per million input tokens when used through Kilo Code via OpenRouter.

What are Z.ai: GLM 4.6's coding benchmark scores?

Z.ai: GLM 4.6 scores 29.5 on the Artificial Analysis Coding Index. It outputs approximately 44 tokens per second.

How do I use Z.ai: GLM 4.6 in Kilo Code?

Install Kilo Code, open the model selector in the chat panel, search for Z.ai: GLM 4.6, and select it to start coding immediately. No additional setup is required.

All models

Z.ai: GLM 4.6 Coding Benchmark

Compared with GLM-4.5, this generation brings several key improvements: Longer context window: The context window has been expanded from 128K to 200K tokens, enabling the model to handle more complex...

Context202,752tokens

Max Output131,072tokens

Inputmodality

Price$0.43/1M input

Ranks from Kilo Code Leaderboard Pricing via OpenRouter

Coding Performance

Coding benchmarks and performance metrics for development tasks

Benchmark

Score

Description

AA Coding Index

29.5%

Overall coding capability score

LiveCodeBench

69.5%

Real-world coding task performance

SciCode

38.4%

Scientific computing & algorithms

TerminalBench Hard

25.0%

CLI & terminal command generation

LCR

54.3%

Long context reasoning

IFBench

43.4%

Instruction following accuracy

Speed & Efficiency

Metric

Value

Description

Output Speed

44 tok/s

Median output tokens per second

Performance metrics from Artificial Analysis

Real-World Usage

Real-world usage statistics from the Kilo Code community

Weekly Token Usage

Mode Rankings (Last Week)

Where this model ranks for each built-in mode

Code

Write, modify, and refactor code

No data

Ask

Get answers and explanations

No data

Debug

Diagnose and fix software issues

#94

Orchestrator

Coordinate tasks across multiple modes

No data

Real-world metrics from the Kilo Code Leaderboard

Pricing

Cost per 1 million tokens

Input Tokens

$0.43

per 1M tokens

Output Tokens

$1.74

per 1M tokens

Example Cost

Analyzing a 10,000 line codebase (≈40k input tokens, 10k output tokens) costs approximately $0.0346

Coding Capabilities

Features and parameters relevant to coding tasks

Coding Features

Function Calling

Can call external functions/APIs

Tool Choice

Control over function selection

Structured Outputs

JSON schema validation

Reasoning Tokens

Extended thinking for complex problems

Pricing details from OpenRouter

Technical Details

Architecture and implementation specifications

Model ID: z-ai/glm-4.6
Artificial Analysis Slug: glm-4-6-reasoning
Created: September 30, 2025
Tokenizer: Other
Input Modalities: Text
Context Window: 202,752 tokens
Max Completion Tokens: 131,072 tokens
Input Price: $0.43 per 1M tokens
Output Price: $1.74 per 1M tokens
Cache Read Price: $0.08 per 1M tokens
Content Moderation: Disabled

Ready to try Z.ai: GLM 4.6?

Install Kilo Code and start using Z.ai: GLM 4.6 for your coding projects today. Choose from 500+ AI models with complete freedom.

Install Kilo Code
Get the extension from VS Code Marketplace, JetBrains Plugin Repository, or the CLI.
Open the model selector
Click the model name in the Kilo Code chat panel to open the selector.
Choose your model
Search or browse to find and select your preferred model.
Start coding
Use Code, Ask, Debug, or Plan mode — the model is ready immediately.

Get Started with Kilo

Z.ai: GLM 4.6 Coding Benchmark

Try Z.ai: GLM 4.6 in Kilo Code

Coding Performance

Speed & Efficiency

Real-World Usage

Weekly Token Usage

Mode Rankings (Last Week)

Code

Ask

Debug

Orchestrator

Pricing

Example Cost

Coding Capabilities

Coding Features

Technical Details

Ready to try Z.ai: GLM 4.6?