VASA-1: Lifelike Audio-Driven Talking Faces
Single portrait photo + speech audio = hyper-realistic talking face video with precise lip-audio sync, lifelike facial behavior, and naturalistic head movements, generated in real time.
Welcome to Incremental Social! Learn more about this project here!
Check out lemmyverse to find more communities to join from here!
Single portrait photo + speech audio = hyper-realistic talking face video with precise lip-audio sync, lifelike facial behavior, and naturalistic head movements, generated in real time.