no code implementations • 19 May 2024 • Aditya Kumar Singh, Dhruv Srivastava, Makarand Tapaswi
We introduce multimodal story summarization by leveraging TV episode recaps - short video sequences interweaving key story moments from previous episodes to bring viewers up to speed.
1 code implementation • CVPR 2023 • Dhruv Srivastava, Aditya Kumar Singh, Makarand Tapaswi
Towards this goal, we formulate emotion understanding as predicting a diverse and multi-label set of emotions at the level of a movie scene and for each character.
no code implementations • 6 Jan 2022 • Aditya Kumar Singh, B. Uma Shankar
This report aims to label the satellite image chips of the Amazon rainforest with atmospheric and various classes of land cover or land use through different machine learning and superior deep learning models.