no code implementations • 22 Jul 2023 • Yao Liu, Gangfeng Cui, Jiahui Luo, Xiaojun Chang, Lina Yao
Subsequently, a frame features learning module and a two-stream multi-level feature aggregation module extract global and partial features from the sampled frames, effectively representing the local-region spatial information, appearance information, and motion information related to the interactions.