Search Results for author: Maxime Burchi

Found 6 papers, 4 papers with code

MuDreamer: Learning Predictive World Models without Reconstruction

no code implementations23 May 2024 Maxime Burchi, Radu Timofte

In this paper, we present MuDreamer, a robust reinforcement learning agent that builds upon the DreamerV3 algorithm by learning a predictive world model without the need for reconstructing input signals.

Multilingual Audio-Visual Speech Recognition with Hybrid CTC/RNN-T Fast Conformer

no code implementations14 Mar 2024 Maxime Burchi, Krishna C. Puvvada, Jagadeesh Balam, Boris Ginsburg, Radu Timofte

Humans are adept at leveraging visual cues from lip movements for recognizing speech in adverse listening conditions.

Audio-Visual Efficient Conformer for Robust Speech Recognition

1 code implementation4 Jan 2023 Maxime Burchi, Radu Timofte

We improve previous lip reading methods using an Efficient Conformer back-end on top of a ResNet-18 visual front-end and by adding intermediate CTC losses between blocks.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Swin2SR: SwinV2 Transformer for Compressed Image Super-Resolution and Restoration

2 code implementations22 Sep 2022 Marcos V. Conde, Ui-Jin Choi, Maxime Burchi, Radu Timofte

Using this method we can tackle the major issues in training transformer vision models, such as training instability, resolution gaps between pre-training and fine-tuning, and hunger on data.

Compressed Image Super-resolution Image Super-Resolution +1

Conformer and Blind Noisy Students for Improved Image Quality Assessment

1 code implementation27 Apr 2022 Marcos V. Conde, Maxime Burchi, Radu Timofte

Learning-based approaches for perceptual image quality assessment (IQA) usually require both the distorted and reference image for measuring the perceptual quality accurately.

Blind Image Quality Assessment Image Restoration +3

Efficient conformer: Progressive downsampling and grouped attention for automatic speech recognition

1 code implementation31 Aug 2021 Maxime Burchi, Valentin Vielzeuf

The recently proposed Conformer architecture has shown state-of-the-art performances in Automatic Speech Recognition by combining convolution with attention to model both local and global dependencies.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Cannot find the paper you are looking for? You can Submit a new open access paper.