no code implementations • 11 Oct 2023 • Rashid Khan, Bingding Huang, Haseeb Hassan, Asim Zaman, Zhongfu Ye
Image captioning is a challenging task involving generating a textual description for an image using computer vision and natural language processing techniques.
no code implementations • 7 Dec 2022 • Genshun Wan, Tan Liu, Hang Chen, Jia Pan, Cong Liu, Zhongfu Ye
Self-supervised learning (SSL) models have achieved considerable improvements in automatic speech recognition (ASR).
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 3 Mar 2022 • Rashid Khan, M Shujah Islam, Khadija Kanwal, Mansoor Iqbal, Md. Imran Hossain, Zhongfu Ye
Image captioning is a fast-growing research field of computer vision and natural language processing that involves creating text explanations for images.