1 code implementation • ICCV 2023 • Yisheng Zhu, Hu Han, Zhengtao Yu, Guangcan Liu
Specifically, we design a Relative Visual Tempo Learning (RVTL) task to explore the motion information in intra-video clips, and an Appearance-Consistency (AC) task to learn appearance information simultaneously, resulting in more representative spatiotemporal features.