no code implementations • 19 Feb 2021 • Neeraj Kumar, Srishti Goel, Ankur Narang, Brejesh lall, Mujtaba Hasan, Pranshu Agarwal, Dipankar Sarkar
We propose a novel method OneShotAu2AV to generate an animated video of arbitrary length using an audio clip and a single unseen image of a person as an input.
no code implementations • 14 Dec 2020 • Neeraj Kumar, Srishti Goel, Ankur Narang, Brejesh lall
High quality multi-speaker speech synthesis while considering prosody and in a few shot manner is an area of active research with many real-world applications.
no code implementations • 14 Dec 2020 • Neeraj Kumar, Srishti Goel, Ankur Narang, Brejesh lall
The multi-modal adaptive normalization uses the various features of audio and video such as Mel spectrogram, pitch, energy from audio signals and predicted keypoint heatmap/optical flow and a single image to learn the respective affine parameters to generate highly expressive video.
no code implementations • 14 Dec 2020 • Neeraj Kumar, Srishti Goel, Ankur Narang, Mujtaba Hasan
High-quality video generation with expressive facial movements is a challenging problem that involves complex learning steps for generative adversarial networks.