1 code implementation • 19 Dec 2023 • Zhihang Liu, Jun Li, Hongtao Xie, Pandeng Li, Jiannan Ge, Sun-Ao Liu, Guoqing Jin
In this paper, we introduce Modal-Enhanced Semantic Modeling (MESM), a novel framework for more balanced alignment through enhancing features at two levels.
2 code implementations • ICMR 2021 • Zilong Fu, Guoqing Jin, Hongtao Xie, Junbo Guo
To tackle this issue, in this paper, we propose a dual parallel attention network (DPAN), in which a newly designed parallel context attention module (PCAM) is cascaded with the original PPAM, using linguistic contextual information to compensate for the information inconsistency between queries and keys.
Ranked #13 on Scene Text Recognition on ICDAR2013
3 code implementations • 18 Jul 2017 • Shiwei Shen, Guoqing Jin, Ke Gao, Yongdong Zhang
Although neural networks could achieve state-of-the-art performance while recongnizing images, they often suffer a tremendous defeat from adversarial examples--inputs generated by utilizing imperceptible but intentional perturbation to clean samples from the datasets.