AI Agents

ASA: Training-Free Representation Engineering for Tool-Calling Agents

YYoujin WangRRun ZhouRRong FuSShuaishuai CaoHHongwei ZengJJiaxuan LuSSicheng FanJJiaqiao ZhaoLLiangming Pan
Published
February 4, 2026
Authors
9
Word Count
9,530
Code
Includes code

Training-free activation steering fixes LLM tool-calling failures without model retraining.

Abstract

Adapting LLM agents to domain-specific tool calling remains notably brittle under evolving interfaces. Prompt and schema engineering is easy to deploy but often fragile under distribution shift and strict parsers, while continual parameter-efficient fine-tuning improves reliability at the cost of training, maintenance, and potential forgetting. We identify a critical Lazy Agent failure mode where tool necessity is nearly perfectly decodable from mid-layer activations, yet the model remains conservative in entering tool mode, revealing a representation-behavior gap. We propose Activation Steering Adapter (ASA), a training-free, inference-time controller that performs a single-shot mid-layer intervention and targets tool domains via a router-conditioned mixture of steering vectors with a probe-guided signed gate to amplify true intent while suppressing spurious triggers. On MTU-Bench with Qwen2.5-1.5B, ASA improves strict tool-use F1 from 0.18 to 0.50 while reducing the false positive rate from 0.15 to 0.05, using only about 20KB of portable assets and no weight updates.

Key Takeaways

  • 1

    ASA uses activation steering at inference time to fix tool-calling failures without retraining models.

  • 2

    Domain-specific steering vectors guide models toward correct tool use across different problem types.

  • 3

    Training-free representation engineering bridges the gap between what models know and what they do.

Limitations

  • Requires computing separate steering vectors for each domain, adding complexity to deployment.

  • Effectiveness depends on accurate domain classification and may not generalize to novel tool types.

Keywords

tool callingactivation steering adaptermid-layer activationsrepresentation-behavior gapsteering vectorsprobe-guided signed gatecontinual parameter-efficient fine-tuningdistribution shiftstrict parsersfalse positive rateQwen2.5-1.5B

More in AI Agents

View all
ASA: Training-Free Representation Engineering for Tool-Calling Agents | Paperchime