We propose TeRA, the first latent diffusion model specifically designed for text-guided 3D avatar generation. TeRA achieves superior inference speed, text-to-3D alignment, and visual quality, while naturally supporting text-guided structure-aware editing.
ICCV 2025    Project Page    Code