AI Agents

GLM-5: from Vibe Coding to Agentic Engineering

GGLM-5 TeamAAohan ZengXXin LvZZhenyu HouZZhengxiao DuQQinkai ZhengBBin ChenDDa YinCChendi GeCChengxing XieCCunxiang WangGGengzheng PanHHao ZengHHaoke ZhangHHaoran WangHHuilong ChenJJiajie ZhangJJian JiaoJJiaqi GuoJJingsen WangJJingzhao DuJJinzhu WuKKedong WangLLei LiLLin FanLLucen ZhongMMingdao LiuMMingming ZhaoPPengfan DuQQian DongRRui LuSShuang-LiSShulin CaoSSong LiuTTing JiangXXiaodong ChenXXiaohan ZhangXXuancheng HuangXXuezhen DongYYabo XuYYao WeiYYifan AnYYilin NiuYYitong ZhuYYuanhao WenYYukuo CenYYushi BaiZZhongpei QiaoZZihan WangZZikang WangZZilin ZhuZZiqiang LiuZZixuan LiBBojie WangBBosi WenCCan HuangCChangpeng CaiCChao YuCChen LiCChen LiCChenghua HuangCChengwei HuCChenhui ZhangCChenzheng ZhuCCongfeng YinDDaoyan LinDDayong YangDDi WangDDing AiEErle ZhuFFangzhou YiFFeiyu ChenGGuohong WenHHailong SunHHaisha ZhaoHHaiyi HuHHanchen ZhangHHanrui LiuHHanyu ZhangHHao PengHHao TaiHHaobo ZhangHHe LiuHHongwei WangHHongxi YanHHongyu GeHHuan LiuHHuan LiuHHuanpeng ChuJJia'ni ZhaoJJiachen WangJJiajing ZhaoJJiamin RenJJiapeng WangJJiaxin ZhangJJiayi GuiJJiayue ZhaoJJijie LiJJing AnJJing LiJJingwei YuanJJinhua DuJJinxin LiuJJunkai ZhiJJunwen DuanKKaiyue ZhouKKangjian WeiKKe WangKKeyun LuoLLaiqiang ZhangLLeigang ShaLLiang XuLLindong WuLLintao DingLLu ChenMMinghao LiNNianyi LinPPan TaQQiang ZouRRongjun SongRRuiqi YangSShangqing TuSShangtong YangSShaoxiang WuSShengyan ZhangSShijie LiSShuang LiSShuyi FanWWei QinWWei TianWWeining ZhangWWenbo YuWWenjie LiangXXiang KuangXXiangmeng ChengXXiangyang LiXXiaoquan YanXXiaowei HuXXiaoying LingXXing FanXXingye XiaXXinyuan ZhangXXinze ZhangXXirui PanXXunkai ZhangYYandong WuYYanfu LiYYidong WangYYifan ZhuYYijun TanYYilin ZhouYYiming PanYYing ZhangYYinpei SuYYipeng GengYYipeng GengYYong YanYYonglin TanYYuean BiYYuhan ShenYYuhao YangYYujiang LiYYunan LiuYYunqing WangYYuntao LiYYurong WuYYutao ZhangYYuxi DuanYYuxuan ZhangZZezhen LiuZZhengtao JiangZZhenhe YanZZheyu ZhangZZhixiang WeiZZhuo ChenZZhuoer FengZZijun YaoZZiwei ChaiZZiyuan WangZZuzhou ZhangBBin XuMMinlie HuangHHongning WangJJuanzi LiYYuxiao DongJJie Tang
Published
February 17, 2026
Authors
186
Word Count
8,702
Code
Includes code

GLM-5 achieves autonomous software engineering through sparse attention and efficient training adaptation.

Abstract

We present GLM-5, a next-generation foundation model designed to transition the paradigm of vibe coding to agentic engineering. Building upon the agentic, reasoning, and coding (ARC) capabilities of its predecessor, GLM-5 adopts DSA to significantly reduce training and inference costs while maintaining long-context fidelity. To advance model alignment and autonomy, we implement a new asynchronous reinforcement learning infrastructure that drastically improves post-training efficiency by decoupling generation from training. Furthermore, we propose novel asynchronous agent RL algorithms that further improve RL quality, enabling the model to learn from complex, long-horizon interactions more effectively. Through these innovations, GLM-5 achieves state-of-the-art performance on major open benchmarks. Most critically, GLM-5 demonstrates unprecedented capability in real-world coding tasks, surpassing previous baselines in handling end-to-end software engineering challenges. Code, models, and more information are available at https://github.com/zai-org/GLM-5.

Key Takeaways

  • 1

    GLM-5 moves from human-guided vibe coding to fully autonomous agentic engineering that handles end-to-end software tasks.

  • 2

    DeepSeek Sparse Attention reduces computational complexity by intelligently selecting relevant tokens instead of attending to all.

  • 3

    Dense warm-up followed by sparse adaptation dramatically reduces training costs while maintaining performance on long-context tasks.

Limitations

  • Training massive models with 744 billion parameters still requires millions of dollars in computational resources.

  • Sparse attention implementation required custom architectural innovations like Muon Split to avoid performance gaps with standard optimizers.

Keywords

DSAasynchronous reinforcement learningagentic engineeringvibe codingARC capabilitiespost-training efficiencymodel alignmentautonomous agentssoftware engineeringopen benchmarks

More in AI Agents

View all
GLM-5: from Vibe Coding to Agentic Engineering | Paperchime