Generative AI

Pathwise Test-Time Correction for Autoregressive Long Video Generation

XXunzhi XiangZZixuan DuanGGuiyu ZhangHHaiyu ZhangZZhe GaoJJunta WuSShaofeng ZhangTTengfei WangQQi FanCChunchao Guo
Published
February 5, 2026
Authors
10
Word Count
8,653
Code
Includes code

Enhances long video generation with path-wise correction.

Abstract

Distilled autoregressive diffusion models facilitate real-time short video synthesis but suffer from severe error accumulation during long-sequence generation. While existing Test-Time Optimization (TTO) methods prove effective for images or short clips, we identify that they fail to mitigate drift in extended sequences due to unstable reward landscapes and the hypersensitivity of distilled parameters. To overcome these limitations, we introduce Test-Time Correction (TTC), a training-free alternative. Specifically, TTC utilizes the initial frame as a stable reference anchor to calibrate intermediate stochastic states along the sampling trajectory. Extensive experiments demonstrate that our method seamlessly integrates with various distilled models, extending generation lengths with negligible overhead while matching the quality of resource-intensive training-based methods on 30-second benchmarks.

Key Takeaways

  • 1

    Introduces Test-Time Correction for autoregressive video generation.

  • 2

    Enhances temporal coherence and visual fidelity.

  • 3

    No need for retraining, only trajectory-aware interventions.

Limitations

  • Focuses on appearance refinement, not global structure.

  • Relies on the quality of the initial frame.

Keywords

distilled autoregressive diffusion modelsTest-Time OptimizationTest-Time Correctionerror accumulationlong-sequence generationreward landscapeshypersensitivitystochastic statessampling trajectory

More in Generative AI

View all
Pathwise Test-Time Correction for Autoregressive Long Video Generation | Paperchime