CoCo: Code as CoT for Text-to-Image Preview and Rare Concept Generation
Haodong Li, Chunmei Qing, Huanyu Zhang +10 more
Recent advancements in Unified Multimodal Models (UMMs) have significantly advanced text-to-image (T2I) generation, particularly through the integration of Chain-of-Thought (CoT) reasoning. However, existing CoT-based T2I methods largely rely on abstract natural-language planning, which lacks the pr...