GLM-5: from Vibe Coding to Agentic Engineering

GGLM-5 TeamAAohan ZengXXin LvZZhenyu HouZZhengxiao DuQQinkai ZhengBBin ChenDDa YinCChendi GeCChengxing XieCCunxiang WangGGengzheng PanHHao ZengHHaoke ZhangHHaoran WangHHuilong ChenJJiajie ZhangJJian JiaoJJiaqi GuoJJingsen WangJJingzhao DuJJinzhu WuKKedong WangLLei LiLLin FanLLucen ZhongMMingdao LiuMMingming ZhaoPPengfan DuQQian DongRRui LuSShuang-LiSShulin CaoSSong LiuTTing JiangXXiaodong ChenXXiaohan ZhangXXuancheng HuangXXuezhen DongYYabo XuYYao WeiYYifan AnYYilin NiuYYitong ZhuYYuanhao WenYYukuo CenYYushi BaiZZhongpei QiaoZZihan WangZZikang WangZZilin ZhuZZiqiang LiuZZixuan LiBBojie WangBBosi WenCCan HuangCChangpeng CaiCChao YuCChen LiCChen LiCChenghua HuangCChengwei HuCChenhui ZhangCChenzheng ZhuCCongfeng YinDDaoyan LinDDayong YangDDi WangDDing AiEErle ZhuFFangzhou YiFFeiyu ChenGGuohong WenHHailong SunHHaisha ZhaoHHaiyi HuHHanchen ZhangHHanrui LiuHHanyu ZhangHHao PengHHao TaiHHaobo ZhangHHe LiuHHongwei WangHHongxi YanHHongyu GeHHuan LiuHHuan LiuHHuanpeng ChuJJia'ni ZhaoJJiachen WangJJiajing ZhaoJJiamin RenJJiapeng WangJJiaxin ZhangJJiayi GuiJJiayue ZhaoJJijie LiJJing AnJJing LiJJingwei YuanJJinhua DuJJinxin LiuJJunkai ZhiJJunwen DuanKKaiyue ZhouKKangjian WeiKKe WangKKeyun LuoLLaiqiang ZhangLLeigang ShaLLiang XuLLindong WuLLintao DingLLu ChenMMinghao LiNNianyi LinPPan TaQQiang ZouRRongjun SongRRuiqi YangSShangqing TuSShangtong YangSShaoxiang WuSShengyan ZhangSShijie LiSShuang LiSShuyi FanWWei QinWWei TianWWeining ZhangWWenbo YuWWenjie LiangXXiang KuangXXiangmeng ChengXXiangyang LiXXiaoquan YanXXiaowei HuXXiaoying LingXXing FanXXingye XiaXXinyuan ZhangXXinze ZhangXXirui PanXXunkai ZhangYYandong WuYYanfu LiYYidong WangYYifan ZhuYYijun TanYYilin ZhouYYiming PanYYing ZhangYYinpei SuYYipeng GengYYipeng GengYYong YanYYonglin TanYYuean BiYYuhan ShenYYuhao YangYYujiang LiYYunan LiuYYunqing WangYYuntao LiYYurong WuYYutao ZhangYYuxi DuanYYuxuan ZhangZZezhen LiuZZhengtao JiangZZhenhe YanZZheyu ZhangZZhixiang WeiZZhuo ChenZZhuoer FengZZijun YaoZZiwei ChaiZZiyuan WangZZuzhou ZhangBBin XuMMinlie HuangHHongning WangJJuanzi LiYYuxiao DongJJie Tang

Published: February 17, 2026
Authors: 186
Word Count: 8,702
Code: Includes code

View on arXiv Download PDF

GLM-5 achieves autonomous software engineering through sparse attention and efficient training adaptation.

Abstract

We present GLM-5, a next-generation foundation model designed to transition the paradigm of vibe coding to agentic engineering. Building upon the agentic, reasoning, and coding (ARC) capabilities of its predecessor, GLM-5 adopts DSA to significantly reduce training and inference costs while maintaining long-context fidelity. To advance model alignment and autonomy, we implement a new asynchronous reinforcement learning infrastructure that drastically improves post-training efficiency by decoupling generation from training. Furthermore, we propose novel asynchronous agent RL algorithms that further improve RL quality, enabling the model to learn from complex, long-horizon interactions more effectively. Through these innovations, GLM-5 achieves state-of-the-art performance on major open benchmarks. Most critically, GLM-5 demonstrates unprecedented capability in real-world coding tasks, surpassing previous baselines in handling end-to-end software engineering challenges. Code, models, and more information are available at https://github.com/zai-org/GLM-5.

Key Takeaways

1
GLM-5 moves from human-guided vibe coding to fully autonomous agentic engineering that handles end-to-end software tasks.
2
DeepSeek Sparse Attention reduces computational complexity by intelligently selecting relevant tokens instead of attending to all.
3
Dense warm-up followed by sparse adaptation dramatically reduces training costs while maintaining performance on long-context tasks.

Limitations

Training massive models with 744 billion parameters still requires millions of dollars in computational resources.
Sparse attention implementation required custom architectural innovations like Muon Split to avoid performance gaps with standard optimizers.

Keywords

DSAasynchronous reinforcement learningagentic engineeringvibe codingARC capabilitiespost-training efficiencymodel alignmentautonomous agentssoftware engineeringopen benchmarks

More in AI Agents

View all

LongCat-Flash-Thinking-2601 Technical Report

Meituan LongCat Team, Anchun Gui +160

We introduce LongCat-Flash-Thinking-2601, a 560-billion-parameter open-source Mixture-of-Experts (MoE) reasoning model with superior agentic reasoning capability. LongCat-Flash-Thinking-2601 achieves ...

Jan 23149

Agentic Reasoning for Large Language Models

Tianxin Wei, Ting-Wei Li +27

Reasoning is a fundamental cognitive process underlying inference, problem-solving, and decision-making. While large language models (LLMs) demonstrate strong reasoning capabilities in closed-world se...

Jan 18149

Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters

Ailin Huang, Ang Li +213

We introduce Step 3.5 Flash, a sparse Mixture-of-Experts (MoE) model that bridges frontier-level agentic intelligence and computational efficiency. We focus on what matters most when building agents: ...

Feb 11140

UI-Venus-1.5 Technical Report

Veuns-Team, Changlong Gao +25

GUI agents have emerged as a powerful paradigm for automating interactions in digital environments, yet achieving both broad generality and consistently strong task performance remains challenging.In ...

Feb 9140

daVinci-Dev: Agent-native Mid-training for Software Engineering

Ji Zeng, Dayuan Fu +15

Recently, the frontier of Large Language Model (LLM) capabilities has shifted from single-turn code generation to agentic software engineering-a paradigm where models autonomously navigate, edit, and ...

Jan 26113

More AI Agents papers