Search Results for author: Teemu Vahtola

Found 5 papers, 1 papers with code

Coping with Noisy Training Data Labels in Paraphrase Detection

no code implementations • WNUT (ACL) 2021 • Teemu Vahtola, Mathias Creutz, Eetu Sjöblom, Sami Itkonen

We present new state-of-the-art benchmarks for paraphrase detection on all six languages in the Opusparcus sentential paraphrase corpus: English, Finnish, French, German, Russian, and Swedish.

Translation

Paper
Add Code

Modeling Noise in Paraphrase Detection

no code implementations • LREC 2022 • Teemu Vahtola, Eetu Sjöblom, Jörg Tiedemann, Mathias Creutz

Noisy labels in training data present a challenging issue in classification tasks, misleading a model towards incorrect decisions during training.

Paper
Add Code

SemEval-2024 Shared Task 6: SHROOM, a Shared-task on Hallucinations and Related Observable Overgeneration Mistakes

no code implementations • 12 Mar 2024 • Timothee Mickus, Elaine Zosa, Raúl Vázquez, Teemu Vahtola, Jörg Tiedemann, Vincent Segonne, Alessandro Raganato, Marianna Apidianaki

This paper presents the results of the SHROOM, a shared task focused on detecting hallucinations: outputs from natural language generation (NLG) systems that are fluent, yet inaccurate.

Machine Translation Paraphrase Generation

Paper
Add Code

Semantic Search as Extractive Paraphrase Span Detection

1 code implementation • 9 Dec 2021 • Jenna Kanerva, Hanna Kitti, Li-Hsin Chang, Teemu Vahtola, Mathias Creutz, Filip Ginter

In this paper, we approach the problem of semantic search by framing the search task as paraphrase span detection, i. e. given a segment of text as a query phrase, the task is to identify its paraphrase in a given document, the same modelling setup as typically used in extractive question answering.

Extractive Question-Answering Question Answering +5

Paper
Code

Grammatical Error Generation Based on Translated Fragments

no code implementations • NoDaLiDa 2021 • Eetu Sjöblom, Mathias Creutz, Teemu Vahtola

We perform neural machine translation of sentence fragments in order to create large amounts of training data for English grammatical error correction.

Grammatical Error Correction Machine Translation +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.