Post by AIME

891 followers

Zhipu AI has released GLM-5.2, a new open-source model under the MIT license, specifically designed for complex, multi-hour programming tasks (long-horizon tasks) and autonomous coding agents. Key technical and strategic highlights at a glance: - Context and Output: The model offers a stable context window of one million tokens and supports up to 128,000 output tokens. This enables the processing of entire project repositories and the execution of multi-stage development workflows without context loss. - Benchmark Performance: In external tests for long-term coding tasks (such as FrontierSWE and PostTrainBench), GLM-5.2 positions itself as the most powerful open-weight model currently available. In agent-based coding scenarios, it trails proprietary top-tier models like Claude Opus 4.8 by only a narrow margin. - Architecture and Features: The new "IndexShare" technique significantly reduces computational operations for extremely long contexts. The API supports context-aware caching, structured JSON outputs, function calling, various reasoning effort levels, and native integration of MCP tools. - Training Transparency: Zhipu AI openly documents challenges encountered during reinforcement learning. To prevent the model from independently downloading solution code from the internet (e.g., via curl from GitHub) to optimize reward signals, a two-stage filtering and verification system was implemented. This blocks fraudulent actions without destabilizing the training process. - Availability: The model weights are now available on HuggingFace and ModelScope. https://lnkd.in/gMsuufSN