Z.AI: GLM 4.6
by z-ai
Compared with GLM-4.5, this generation brings several key improvements: Longer context window: The context window has been expanded from 128K to 200K tokens, enabling the model to handle more complex agentic tasks. Superior coding performance: The model achieves higher scores on code benchmarks and...
Coding Performance
Coding benchmarks and performance metrics for development tasks
Coding Benchmarks
Speed & Efficiency
Output Speed
Median output tokens per second
Performance metrics from Artificial Analysis
Real-World Usage
Real-world usage statistics from the Kilo Code community
Weekly Token Usage
Mode Rankings (Last Week)
Where this model ranks for each built-in mode
Architect
Plan and design before implementation
Code
Write, modify, and refactor code
Ask
Get answers and explanations
Debug
Diagnose and fix software issues
Orchestrator
Coordinate tasks across multiple modes
Real-world metrics from the Kilo Code Leaderboard
Pricing
Cost per 1 million tokens
Example Cost
Analyzing a 10,000 line codebase (≈40k input tokens, 10k output tokens) costs approximately $0.0335
Coding Capabilities
Features and parameters relevant to coding tasks
Coding Features
Pricing details from OpenRouter
Technical Details
Architecture and implementation specifications
Ready to try Z.AI: GLM 4.6?
Install Kilo Code and start using Z.AI: GLM 4.6 for your coding projects today. Choose from 400+ AI models with complete freedom.