no code implementations • 18 Apr 2024 • Han Fang, Xianghao Zang, Chao Ban, Zerun Feng, Lanxiang Zhou, Zhongjiang He, Yongxiang Li, Hao Sun
Text-video retrieval aims to find the most relevant cross-modal samples for a given query.
no code implementations • 13 May 2023 • Han Fang, Zhifei Yang, Xianghao Zang, Chao Ban, Hao Sun
Specifically, after applying attention-based video masking to generate high-informed and low-informed masks, we propose Informed Semantics Completion to recover masked semantics information.
1 code implementation • 12 Feb 2022 • Xianghao Zang, Ge Li, Wei Gao
To fuse multi-scale feature representation, this paper presents a pyramid structure containing global-level information and many pieces of local-level information from different scales.
1 code implementation • 10 Nov 2021 • Xianghao Zang, Ge Li, Wei Gao, Xiujun Shu
In this way, the complex scenes in the ReID task are effectively disentangled, and the burden of each branch is relieved.
Ranked #2 on Person Re-Identification on P-DukeMTMC-reID
1 code implementation • 9 Nov 2021 • Xianghao Zang, Ge Li, Wei Gao, Xiujun Shu
A local-aware module is employed to explore the poentials of local-level feature for unsupervised learning.
Ranked #1 on Unsupervised Person Re-Identification on PRID2011
Unsupervised Person Re-Identification Video-Based Person Re-Identification
2 code implementations • 31 May 2021 • Xiujun Shu, Xiao Wang, Xianghao Zang, Shiliang Zhang, Yuanqi Chen, Ge Li, Qi Tian
We also verified that models pre-trained on LaST can generalize well on existing datasets with short-term and cloth-changing scenarios.