no code implementations • ICCV 2023 • David Fan, Jue Wang, Shuai Liao, Yi Zhu, Vimal Bhat, Hector Santos-Villalobos, Rohith MV, Xinyu Li
This suggests that the random masking strategy that is inherited from the image MAE is less effective for video MAE.
no code implementations • ICCV 2023 • Najmeh Sadoughi, Xinyu Li, Avijit Vajpayee, David Fan, Bing Shuai, Hector Santos-Villalobos, Vimal Bhat, Rohith MV
Previous research has studied the task of segmenting cinematic videos into scenes and into narrative acts.
no code implementations • 13 Mar 2023 • David Fan, Deyu Yang, Xinyu Li, Vimal Bhat, Rohith MV
Contrastive learning has recently narrowed the gap between self-supervised and supervised methods in image and video domain.
no code implementations • CVPR 2021 • Shixing Chen, Xiaohan Nie, David Fan, Dongqing Zhang, Vimal Bhat, Raffay Hamid
To assess the effectiveness of ShotCoL on novel applications of scene boundary detection, we take on the problem of finding timestamps in movies and TV episodes where video-ads can be inserted while offering a minimally disruptive viewing experience.
no code implementations • CVPR 2020 • Weifeng Chen, Shengyi Qian, David Fan, Noriyuki Kojima, Max Hamilton, Jia Deng
Single-view 3D is the task of recovering 3D properties such as depth and surface normals from a single image.