Qwen3-TTS Technical Report
Hangrui Hu, Xinfa Zhu, Ting He +13 more
In this report, we present the Qwen3-TTS series, a family of advanced multilingual, controllable, robust, and streaming text-to-speech models. Qwen3-TTS supports state-of-the-art 3-second voice cloning and description-based control, allowing both the creation of entirely novel voices and fine-graine...