no code implementations • 21 Mar 2023 • Akash Gupta, Rohun Tripathi, WonDong Jang
Lack of audio-video synchronization is a common problem during television broadcasts and video conferencing, leading to an unsatisfactory viewing experience.
no code implementations • ICCV 2023 • Urwa Muaz, WonDong Jang, Rohun Tripathi, Santhosh Mani, Wenbin Ouyang, Ravi Teja Gadde, Baris Gecer, Sergio Elizondo, Reza Madad, Naveen Nair
Dubbed video generation aims to accurately synchronize mouth movements of a given facial video with driving audio while preserving identity and scene-specific visual dynamics, such as head pose and lighting.
1 code implementation • 19 Jul 2020 • Rohun Tripathi, Vasu Singla, Mahyar Najibi, Bharat Singh, Abhishek Sharma, Larry Davis
The widely adopted sequential variant of Non Maximum Suppression (or Greedy-NMS) is a crucial module for object-detection pipelines.
no code implementations • 12 May 2020 • Rohun Tripathi, Bharat Singh
To this end, RSO adds a perturbation to a weight in a deep neural network and tests if it reduces the loss on a mini-batch.
no code implementations • 2 Jul 2018 • Isay Katsman, Rohun Tripathi, Andreas Veit, Serge Belongie
Semantic segmentation is a challenging vision problem that usually necessitates the collection of large amounts of finely annotated data, which is often quite expensive to obtain.
no code implementations • 11 Sep 2017 • Rohun Tripathi, Aman Gill, Riccha Tripati
Raw offline data lacks the temporal information available in online data and required for prediction using models trained with online data.