1 code implementation • 7 Nov 2023 • Neng Dong, Shuanglin Yan, Hao Tang, Jinhui Tang, Liyan Zhang
Moreover, as multiple images with the same identity are not accessible in the testing stage, we devise an Information Propagation (IP) mechanism to distill knowledge from the comprehensive representation to that of a single occluded image.
no code implementations • 17 Oct 2023 • Shuanglin Yan, Neng Dong, Jun Liu, Liyan Zhang, Jinhui Tang
Since the support set is unavailable during inference, we propose to distill the knowledge learned by the "richer" model into a lightweight model for inference with a single image/text as input.
no code implementations • 6 Aug 2023 • Hao Tang, Jun Liu, Shuanglin Yan, Rui Yan, Zechao Li, Jinhui Tang
Due to the scarcity of manually annotated data required for fine-grained video understanding, few-shot fine-grained (FS-FG) action recognition has gained significant attention, with the aim of classifying novel fine-grained action categories with only a few labeled instances.
1 code implementation • 14 Jul 2023 • Neng Dong, Liyan Zhang, Shuanglin Yan, Hao Tang, Jinhui Tang
Occlusion perturbation presents a significant challenge in person re-identification (re-ID), and existing methods that rely on external visual cues require additional computational resources and only consider the issue of missing information caused by occlusion.
1 code implementation • 19 Oct 2022 • Shuanglin Yan, Neng Dong, Liyan Zhang, Jinhui Tang
Secondly, cross-grained feature refinement (CFR) and fine-grained correspondence discovery (FCD) modules are proposed to establish the cross-grained and fine-grained interactions between modalities, which can filter out non-modality-shared image patches/words and mine cross-modal correspondences from coarse to fine.
no code implementations • 30 Aug 2022 • Shuanglin Yan, Hao Tang, Liyan Zhang, Jinhui Tang
Moreover, existing methods seldom consider the information inequality problem between modalities caused by image-specific information.