Search Results for author: Yasser Abdelaziz Dahou Djilali

Found 6 papers, 3 papers with code

Do Vision and Language Encoders Represent the World Similarly?

1 code implementation • 10 Jan 2024 • Mayug Maniparambil, Raiymbek Akshulakov, Yasser Abdelaziz Dahou Djilali, Sanath Narayan, Mohamed El Amine Seddik, Karttikeya Mangalam, Noel E. O'Connor

In the absence of statistical similarity in aligned encoders like CLIP, we show that a possible matching of unaligned encoders exists without any training.

Graph Matching Image Classification +3

Paper
Code

Do VSR Models Generalize Beyond LRS3?

1 code implementation • 23 Nov 2023 • Yasser Abdelaziz Dahou Djilali, Sanath Narayan, Eustache Le Bihan, Haithem Boussaid, Ebtessam Almazrouei, Merouane Debbah

The Lip Reading Sentences-3 (LRS3) benchmark has primarily been the focus of intense research in visual speech recognition (VSR) during the last few years.

Lip Reading speech-recognition +1

Paper
Code

Learning Saliency From Fixations

no code implementations • 23 Nov 2023 • Yasser Abdelaziz Dahou Djilali, Kevin McGuiness, Noel O'Connor

We present a novel approach for saliency prediction in images, leveraging parallel decoding in transformers to learn saliency solely from fixation maps.

Decoder Saliency Prediction

Paper
Add Code

Lip2Vec: Efficient and Robust Visual Speech Recognition via Latent-to-Latent Visual to Audio Representation Mapping

no code implementations • ICCV 2023 • Yasser Abdelaziz Dahou Djilali, Sanath Narayan, Haithem Boussaid, Ebtessam Almazrouei, Merouane Debbah

Visual Speech Recognition (VSR) differs from the common perception tasks as it requires deeper reasoning over the video sequence, even by human experts.

Lip Reading speech-recognition +1

Paper
Add Code

One-Step Distributional Reinforcement Learning

no code implementations • 27 Apr 2023 • Mastane Achab, REDA ALAMI, Yasser Abdelaziz Dahou Djilali, Kirill Fedyanin, Eric Moulines

Reinforcement learning (RL) allows an agent interacting sequentially with an environment to maximize its long-term expected return.

Distributional Reinforcement Learning reinforcement-learning +1

Paper
Add Code

Rethinking 360deg Image Visual Attention Modelling With Unsupervised Learning.

1 code implementation • ICCV 2021 • Yasser Abdelaziz Dahou Djilali, Tarun Krishna, Kevin McGuinness, Noel E. O'Connor

This performance is achieved using an encoder that is trained in a completely unsupervised way and a relatively lightweight supervised decoder (3. 8 X fewer parameters in the case of the ResNet50 encoder).

Contrastive Learning Decoder +2

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.