1 code implementation • ICCV 2021 • Keyu Wen, Jin Xia, Yuanyuan Huang, Linyang Li, Jiayan Xu, Jie Shao
There are two key designs in it, one is the weight-sharing transformer on top of the visual and textual encoders to align text and image semantically, the other is three kinds of contrastive learning designed for sharing knowledge between different modalities.
2 code implementations • ECCV 2020 • Jiajun Tang, Jin Xia, Xinzhi Mu, Bo Pang, Cewu Lu
We propose the Asynchronous Interaction Aggregation network (AIA) that leverages different interactions to boost action detection.
no code implementations • 13 Aug 2019 • Jin Xia, Jiajun Tang, Cewu Lu
We present our three branch solutions for International Challenge on Activity Recognition at CVPR2019.