no code implementations • TMM 2022 • Bin Wang, Chunsheng Liu, Faliang Chang, Wenqian Wang and Nanjun Li
Action recognition in video understanding is a challenging task, largely because of the complexity and difficulty in temporal modeling, making it suffer from motion information loss and misalignment of temporal attention in spatial dimensions.
Ranked #28 on Action Recognition on Something-Something V1
no code implementations • 20 Dec 2020 • Xiao Zhang, Chunsheng Liu, Faliang Chang
In this paper, we present a novel architecture which introduces a guidance module to encourage the encoder-decoder model to generate words related to the past and future words in a caption.
no code implementations • 27 Jun 2018 • Youmei Zhang, Chunluan Zhou, Faliang Chang, Alex C. Kot
Occlusions, complex backgrounds, scale variations and non-uniform distributions present great challenges for crowd counting in practical applications.