no code implementations • 3 Jan 2024 • Haopeng Li, Qiuhong Ke, Mingming Gong, Tom Drummond
While significant advancements have been made in video question answering (VideoQA), the potential benefits of enhancing model generalization through tailored difficulty scheduling have been largely overlooked in existing research.
1 code implementation • 3 Jan 2024 • Haopeng Li, Andong Deng, Qiuhong Ke, Jun Liu, Hossein Rahmani, Yulan Guo, Bernt Schiele, Chen Chen
Reasoning over sports videos for question answering is an important task with numerous applications, such as player training and information retrieval.
1 code implementation • 30 Jul 2022 • Zhitong Xiong, Haopeng Li, Xiao Xiang Zhu
To address this problem, we propose to aggregate the learnable covariance matrices with a deformable 4D Transformer to effectively predict the segmentation map.
Ranked #1 on Few-Shot Semantic Segmentation on FSS-1000 (5-shot)
1 code implementation • 19 Jul 2021 • Haopeng Li, Lingbo Liu, Kunlin Yang, Shinan Liu, Junyu Gao, Bin Zhao, Rui Zhang, Jun Hou
Video crowd localization is a crucial yet challenging task, which aims to estimate exact locations of human heads in the given crowded videos.
no code implementations • 10 May 2021 • Bin Zhao, Haopeng Li, Xiaoqiang Lu, Xuelong Li
Then, the videos are summarized by exploiting both the local and global dependencies among shots.
2 code implementations • 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2020 • Haopeng Li, Yuan Yuan, Qi. Wang
Video frame interpolation achieves temporal super-resolution by generating smooth transitions between frames.
Ranked #14 on Video Frame Interpolation on Vimeo90K
no code implementations • 4 Feb 2019 • Guo-Qiang Zhang, Haopeng Li, Fabian Wenger
This paper considers object detection and 3D estimation using an FMCW radar.