DFIE3D: 3D-Aware Disentangled Face Inversion and Editing via Facial-Contrastive Learning
Authors: Zhu, X., Zhou, J., You, L., Yang, X., Chang, J., Zhang, J.J. and Zeng, D.
Journal: IEEE Transactions on Circuits and Systems for Video Technology
Volume: 34
Issue: 9
Pages: 8310-8326
eISSN: 1558-2205
ISSN: 1051-8215
DOI: 10.1109/TCSVT.2024.3377121
Abstract:Recent advances in NeRF-based 3D-aware GANs have achieved outstanding performance, especially in the realm of human facial representations, making projection of facial images back into their latent space superior and preferable compared to 2D GAN inversion. However, the direct application of 2DGAN inversion techniques to 3DGAN raises challenges due to potential appearance distortions and geometric inconsistences. To tackle these issues, this work presents a novel integrated framework that combines a composite inversion pipeline in both the SS and W+ spaces and integrates a contrastive-based training strategy, ensuring proficient disentanglement within the module. Moreover, we design a facial semantic manipulation technique based on dimensional analysis of the latent code, which is fully compatible with the proposed 3DGAN inversion pipeline. Comprehensive experimental validations substantiate the effectiveness of the proposed approach in executing 3d-aware face inversion and semantic editing tasks, presenting a robust technological solution for a diverse array of digital human modeling applications in the downstream.
https://eprints.bournemouth.ac.uk/40106/
Source: Scopus
DFIE3D: 3D-Aware Disentangled Face Inversion and Editing Via Facial-contrastive Learning
Authors: Zhu, X., Zhou, J., You, L., Yang, X., Chang, J., Zhang, J.J. and Zeng, D.
Journal: IEEE Transactions on Circuits and Systems for Video Technology
ISSN: 1051-8215
Abstract:Recent advances in NeRF-based 3D-aware GANs have achieved outstanding performance, especially in the realm of human facial representations, making projection of facial images back into their latent space superior and preferable compared to 2D GAN inversion. However, the direct application of 2DGAN inversion techniques to 3DGAN raises challenges due to potential appearance distortions and geometric inconsistences. To tackle these issues, this work presents a novel integrated framework that combines a composite inversion pipeline in both the SS and W+ spaces and integrates a contrastive-based training strategy, ensuring proficient disentanglement within the module. Moreover, we design a facial semantic manipulation technique based on dimensional analysis of the latent code, which is fully compatible with the proposed 3DGAN inversion pipeline. Comprehensive experimental validations substantiate the effectiveness of the proposed approach in executing 3d-aware face inversion and semantic editing tasks, presenting a robust technological solution for a diverse array of digital human modeling applications in the downstream.
https://eprints.bournemouth.ac.uk/40106/
Source: BURO EPrints