AI Agents

Agentic Reasoning for Large Language Models

TTianxin WeiTTing-Wei LiZZhining LiuXXuying NingZZe YangJJiaru ZouZZhichen ZengRRuizhong QiuXXiao LinDDongqi FuZZihao LiMMengting AiDDuo ZhouWWenxuan BaoYYunzhe LiGGaotang LiCCheng QianYYu WangXXiangru TangYYin XiaoLLiri FangHHui LiuXXianfeng TangYYuji ZhangCChi WangJJiaxuan YouHHeng JiHHanghang TongJJingrui He
arXiv ID
2601.12538
Published
January 18, 2026
Authors
29
Hugging Face Likes
149
Comments
4

Abstract

Reasoning is a fundamental cognitive process underlying inference, problem-solving, and decision-making. While large language models (LLMs) demonstrate strong reasoning capabilities in closed-world settings, they struggle in open-ended and dynamic environments. Agentic reasoning marks a paradigm shift by reframing LLMs as autonomous agents that plan, act, and learn through continual interaction. In this survey, we organize agentic reasoning along three complementary dimensions. First, we characterize environmental dynamics through three layers: foundational agentic reasoning, which establishes core single-agent capabilities including planning, tool use, and search in stable environments; self-evolving agentic reasoning, which studies how agents refine these capabilities through feedback, memory, and adaptation; and collective multi-agent reasoning, which extends intelligence to collaborative settings involving coordination, knowledge sharing, and shared goals. Across these layers, we distinguish in-context reasoning, which scales test-time interaction through structured orchestration, from post-training reasoning, which optimizes behaviors via reinforcement learning and supervised fine-tuning. We further review representative agentic reasoning frameworks across real-world applications and benchmarks, including science, robotics, healthcare, autonomous research, and mathematics. This survey synthesizes agentic reasoning methods into a unified roadmap bridging thought and action, and outlines open challenges and future directions, including personalization, long-horizon interaction, world modeling, scalable multi-agent training, and governance for real-world deployment.

Keywords

large language modelsagentic reasoningautonomous agentsplanningtool usesearchfeedbackmemoryadaptationcollaborative settingscoordinationknowledge sharingreinforcement learningsupervised fine-tuningin-context reasoningpost-training reasoningreal-world applicationsbenchmarksthought and actionworld modelingscalable multi-agent traininggovernance

More in AI Agents

View all
Agentic Reasoning for Large Language Models | Paperchime