CtrlAvatar: Controllable Avatars Generation via Disentangled Invertible Networks

[AAAI 2025]
Wenfeng Song1, Yang Ding1, Fei Hou2,3, Shuai Li4,5*, Aimin Hao4, Xia Hou1,
1College of Computer Science, Beijing Information Science and Technology University 2Key Laboratory of System Software (CAS), State Key Laboratory of Computer Science, Institute of Software, Chinese Academy of Sciences, China 3University of Chinese Academy of Sciences, China 4State Key Laboratory of Virtual Reality Technology and Systems, Beihang University 5Zhongguancun Laboratory, China
* Correspoonding Author

Abstract

As virtual experiences grow in popularity, the demand for realistic, personalized, and animatable human avatars increases. Traditional methods, relying on fixed templates, often produce costly avatars that lack expressiveness and realism. To overcome these challenges, we introduce Controllable Avatars generation via disentangled invertible networks (CtrlAvatar), a real-time framework for generating lifelike and customizable avatars. CtrlAvatar uses disentangled invertible networks to separate the deformation process into implicit body geometry and explicit texture components. This approach eliminates the need for repeated occupancy reconstruction, enabling detailed and coherent animations. The body geometry component ensures anatomical accuracy, while the texture component allows for complex, artifact-free clothing customization. This architecture ensures smooth integration between body movements and surface details. By optimizing transformations with position-varying offsets from the avatar’s initial Linear Blend Skinning vertices, CtrlAvatar achieves flexible, natural deformations that adapt to various scenarios. Extensive experiments show that CtrlAvatar outperforms other methods in quality, diversity, controllability, and cost-efficiency, marking a significant advancement in avatar generation.

Pipeline

We propose the CtrlAvatar with two key parts: (1) Disentangled Invertible Networks, using an Invertible Delta Network to improve the avatar's implicit geometry for more realistic results; (2) Controllable Avatar Generation, employing explicit texture avatar to generate the realistic appearance.

Results


More Results

Acknowledgements

This paper is supported by Beijing Natural Science Foundation (L232102), National Natural Science Foundation of China (62441201, 62272021), Beijing Science and Technology Plan Project Z231100005923039, National Key R&D Program of China (No. 2023YFF1203803), Basic Research Project of ISCAS (ISCAS-JCMS-202303), Major Research Project of ISCAS (ISCAS-ZD-202401).

BibTeX

@inproceedings{song2025ctrlavatar,
  title={CtrlAvatar: Controllable Avatars Generation via Disentangled Invertible Networks},
  author={Song, Wenfeng and Ding, Yang and Hou, Fei and Li, Shuai and Hao, Aimin and Hou, Xia},
  booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
  volume={39},
  number={7},
  pages={6959--6967},
  year={2025}
}