1 code implementation • 10 Jan 2024 • Mayug Maniparambil, Raiymbek Akshulakov, Yasser Abdelaziz Dahou Djilali, Sanath Narayan, Mohamed El Amine Seddik, Karttikeya Mangalam, Noel E. O'Connor
In the absence of statistical similarity in aligned encoders like CLIP, we show that a possible matching of unaligned encoders exists without any training.
1 code implementation • 23 Nov 2023 • Yasser Abdelaziz Dahou Djilali, Sanath Narayan, Eustache Le Bihan, Haithem Boussaid, Ebtessam Almazrouei, Merouane Debbah
The Lip Reading Sentences-3 (LRS3) benchmark has primarily been the focus of intense research in visual speech recognition (VSR) during the last few years.
no code implementations • 23 Nov 2023 • Yasser Abdelaziz Dahou Djilali, Kevin McGuiness, Noel O'Connor
We present a novel approach for saliency prediction in images, leveraging parallel decoding in transformers to learn saliency solely from fixation maps.
no code implementations • ICCV 2023 • Yasser Abdelaziz Dahou Djilali, Sanath Narayan, Haithem Boussaid, Ebtessam Almazrouei, Merouane Debbah
Visual Speech Recognition (VSR) differs from the common perception tasks as it requires deeper reasoning over the video sequence, even by human experts.
no code implementations • 27 Apr 2023 • Mastane Achab, REDA ALAMI, Yasser Abdelaziz Dahou Djilali, Kirill Fedyanin, Eric Moulines
Reinforcement learning (RL) allows an agent interacting sequentially with an environment to maximize its long-term expected return.
Distributional Reinforcement Learning reinforcement-learning +1
1 code implementation • ICCV 2021 • Yasser Abdelaziz Dahou Djilali, Tarun Krishna, Kevin McGuinness, Noel E. O'Connor
This performance is achieved using an encoder that is trained in a completely unsupervised way and a relatively lightweight supervised decoder (3. 8 X fewer parameters in the case of the ResNet50 encoder).