Search Results for author: Giulia Comini

Found 7 papers, 0 papers with code

Del Visual al Auditivo: Sonorización de Escenas Guiada por Imagen

no code implementations • 2 Feb 2024 • María Sánchez, Laura Fernández, Julián Arias, Mateo Cámara, Giulia Comini, Adam Gabrys, José Luis Blanco, Juan Ignacio Godino, Luis Alfonso Hernández

We present a processing flow that, starting from images extracted from videos, is able to sound them.

Contrastive Learning

Paper
Add Code

Multilingual context-based pronunciation learning for Text-to-Speech

no code implementations • 31 Jul 2023 • Giulia Comini, Manuel Sam Ribeiro, Fan Yang, Heereen Shim, Jaime Lorenzo-Trueba

Phonetic information and linguistic knowledge are an essential component of a Text-to-speech (TTS) front-end.

Paper
Add Code

Improving grapheme-to-phoneme conversion by learning pronunciations from speech recordings

no code implementations • 31 Jul 2023 • Manuel Sam Ribeiro, Giulia Comini, Jaime Lorenzo-Trueba

The G2P model is used to train a multilingual phone recognition system, which then decodes speech recordings with a phonetic representation.

speech-recognition Speech Recognition

Paper
Add Code

Low-data? No problem: low-resource, language-agnostic conversational text-to-speech via F0-conditioned data augmentation

no code implementations • 29 Jul 2022 • Giulia Comini, Goeric Huybrechts, Manuel Sam Ribeiro, Adam Gabrys, Jaime Lorenzo-Trueba

The availability of data in expressive styles across languages is limited, and recording sessions are costly and time consuming.

Data Augmentation Voice Conversion

Paper
Add Code

Voice Filter: Few-shot text-to-speech speaker adaptation using voice conversion as a post-processing module

no code implementations • 16 Feb 2022 • Adam Gabryś, Goeric Huybrechts, Manuel Sam Ribeiro, Chung-Ming Chien, Julian Roth, Giulia Comini, Roberto Barra-Chicote, Bartek Perz, Jaime Lorenzo-Trueba

It uses voice conversion (VC) as a post-processing module appended to a pre-existing high-quality TTS system and marks a conceptual shift in the existing TTS paradigm, framing the few-shot TTS problem as a VC task.

Speech Synthesis Voice Conversion

Paper
Add Code

Cross-speaker style transfer for text-to-speech using data augmentation

no code implementations • 10 Feb 2022 • Manuel Sam Ribeiro, Julian Roth, Giulia Comini, Goeric Huybrechts, Adam Gabrys, Jaime Lorenzo-Trueba

The proposed approach relies on voice conversion to first generate high-quality data from the set of supporting expressive speakers.

Data Augmentation Style Transfer +1

Paper
Add Code

Low-resource expressive text-to-speech using data augmentation

no code implementations • 11 Nov 2020 • Goeric Huybrechts, Thomas Merritt, Giulia Comini, Bartek Perz, Raahil Shah, Jaime Lorenzo-Trueba

While recent neural text-to-speech (TTS) systems perform remarkably well, they typically require a substantial amount of recordings from the target speaker reading in the desired speaking style.

Data Augmentation Voice Conversion

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.