Generative AI

Skin Tokens: A Learned Compact Representation for Unified Autoregressive Rigging

JJia-peng ZhangCCheng-Feng PuMMeng-Hao GuoYYan-Pei CaoSShi-Min Hu
Published
February 4, 2026
Authors
5
Word Count
9,716

Automated rigging with SkinTokens for 3D animation.

Abstract

The rapid proliferation of generative 3D models has created a critical bottleneck in animation pipelines: rigging. Existing automated methods are fundamentally limited by their approach to skinning, treating it as an ill-posed, high-dimensional regression task that is inefficient to optimize and is typically decoupled from skeleton generation. We posit this is a representation problem and introduce SkinTokens: a learned, compact, and discrete representation for skinning weights. By leveraging an FSQ-CVAE to capture the intrinsic sparsity of skinning, we reframe the task from continuous regression to a more tractable token sequence prediction problem. This representation enables TokenRig, a unified autoregressive framework that models the entire rig as a single sequence of skeletal parameters and SkinTokens, learning the complicated dependencies between skeletons and skin deformations. The unified model is then amenable to a reinforcement learning stage, where tailored geometric and semantic rewards improve generalization to complex, out-of-distribution assets. Quantitatively, the SkinTokens representation leads to a 98%-133% percents improvement in skinning accuracy over state-of-the-art methods, while the full TokenRig framework, refined with RL, enhances bone prediction by 17%-22%. Our work presents a unified, generative approach to rigging that yields higher fidelity and robustness, offering a scalable solution to a long-standing challenge in 3D content creation.

Key Takeaways

  • 1

    Introduces SkinTokens for efficient skinning weight representation.

  • 2

    Unified autoregressive model for joint skeleton and skinning.

  • 3

    Improves rigging automation with reinforcement learning.

Limitations

  • Requires extensive training data for optimal performance.

  • May struggle with extremely complex or unconventional geometries.

Keywords

FSQ-CVAEskinning weightstoken sequence predictionautoregressive frameworkreinforcement learninggeometric rewardssemantic rewards

More in Generative AI

View all
Skin Tokens: A Learned Compact Representation for Unified Autoregressive Rigging | Paperchime