V-Express: Conditional Dropout for Progressive Training of Portrait Video Generation

Cong Wang1*, Kuan Tian2*, Jun Zhang2†, Yonghang Guan2, Feng Luo2, Fei Shen2,
Zhiwei Jiang1†, Qing Gu1, Xiao Han2, Wei Yang2
1 Nanjing University, 2 Tencent AI Lab
* Equal Contribution, Corresponding Authors

Abstract

In the field of portrait video generation, the use of single images to generate portrait videos has become increasingly prevalent. A common approach involves leveraging generative models to enhance adapters for controlled generation. However, control signals can vary in strength, including text, audio, image reference, pose, depth map, etc. Among these, weaker conditions often struggle to be effective due to interference from stronger conditions, posing a challenge in balancing these conditions. In our work on portrait video generation, we identified audio signals as particularly weak, often overshadowed by stronger signals such as pose and original image. However, direct training with weak signals often leads to difficulties in convergence. To address this, we propose V-Express, a simple method that balances different control signals through a series of progressive drop operations. Our method gradually enables effective control by weak conditions, thereby achieving generation capabilities that simultaneously take into account pose, input image, and audio. The experimental results demonstrate that our method can effectively generate portrait videos controlled by audio. Furthermore, our method provides a potential solution for the simultaneous and effective use of conditions of varying strengths.

MY ALT TEXT

Results Presentation of V-Express

Naive Retargeting




Offset Retargeting




Fix Face


Cases from EMO

Cases from VASA-1

Effect of Reference & Audio Attention Weights

No Retargeting



Fix Face

BibTeX

@article{wang2024v,
  title={V-Express: Conditional Dropout for Progressive Training of Portrait Video Generation},
  author={Wang, Cong and Tian, Kuan and Zhang, Jun and Guan, Yonghang and Luo, Feng and Shen, Fei and Jiang, Zhiwei and Gu, Qing and Han, Xiao and Yang, Wei},
  journal={arXiv preprint arXiv:2406.02511},
  year={2024}
}