Search Results for author: Xianghao Zang

Found 6 papers, 4 papers with code

ProTA: Probabilistic Token Aggregation for Text-Video Retrieval

no code implementations • 18 Apr 2024 • Han Fang, Xianghao Zang, Chao Ban, Zerun Feng, Lanxiang Zhou, Zhongjiang He, Yongxiang Li, Hao Sun

Text-video retrieval aims to find the most relevant cross-modal samples for a given query.

Retrieval Video Retrieval

Paper
Add Code

Mask to reconstruct: Cooperative Semantics Completion for Video-text Retrieval

no code implementations • 13 May 2023 • Han Fang, Zhifei Yang, Xianghao Zang, Chao Ban, Hao Sun

Specifically, after applying attention-based video masking to generate high-informed and low-informed masks, we propose Informed Semantics Completion to recover masked semantics information.

Retrieval Text Retrieval +1

Paper
Add Code

Multi-direction and Multi-scale Pyramid in Transformer for Video-based Pedestrian Retrieval

1 code implementation • 12 Feb 2022 • Xianghao Zang, Ge Li, Wei Gao

To fuse multi-scale feature representation, this paper presents a pyramid structure containing global-level information and many pieces of local-level information from different scales.

Person Re-Identification Retrieval