CtrlAvatar

Abstract

As virtual experiences grow in popularity, the demand for realistic, personalized, and animatable human avatars increases. Traditional methods, relying on fixed templates, often produce costly avatars that lack expressiveness and realism. To overcome these challenges, we introduce Controllable Avatars generation via disentangled invertible networks (CtrlAvatar), a real-time framework for generating lifelike and customizable avatars. CtrlAvatar uses disentangled invertible networks to separate the deformation process into implicit body geometry and explicit texture components. This approach eliminates the need for repeated occupancy reconstruction, enabling detailed and coherent animations. The body geometry component ensures anatomical accuracy, while the texture component allows for complex, artifact-free clothing customization. This architecture ensures smooth integration between body movements and surface details. By optimizing transformations with position-varying offsets from the avatar’s initial Linear Blend Skinning vertices, CtrlAvatar achieves flexible, natural deformations that adapt to various scenarios. Extensive experiments show that CtrlAvatar outperforms other methods in quality, diversity, controllability, and cost-efficiency, marking a significant advancement in avatar generation.

Pipeline

We propose the CtrlAvatar with two key parts: (1) Disentangled Invertible Networks, using an Invertible Delta Network to improve the avatar's implicit geometry for more realistic results; (2) Controllable Avatar Generation, employing explicit texture avatar to generate the realistic appearance.

Results

Acknowledgements

This paper is supported by Beijing Natural Science Foundation (L232102), National Natural Science Foundation of China (62441201, 62272021), Beijing Science and Technology Plan Project Z231100005923039, National Key R&D Program of China (No. 2023YFF1203803), Basic Research Project of ISCAS (ISCAS-JCMS-202303), Major Research Project of ISCAS (ISCAS-ZD-202401).

BibTeX

@inproceedings{song2025ctrlavatar,
  title={CtrlAvatar: Controllable Avatars Generation via Disentangled Invertible Networks},
  author={Song, Wenfeng and Ding, Yang and Hou, Fei and Li, Shuai and Hao, Aimin and Hou, Xia},
  booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
  volume={39},
  number={7},
  pages={6959--6967},
  year={2025}
}

CtrlAvatar: Controllable Avatars Generation via Disentangled Invertible Networks

Abstract

Pipeline

Results

More Results

Acknowledgements

BibTeX