no code implementations • RANLP 2021 • Khetam Al Sharou, Zhenhao Li, Lucia Specia
In this paper, we propose a definition and taxonomy of various types of non-standard textual content – generally referred to as “noise” – in Natural Language Processing (NLP).
no code implementations • EAMT 2022 • Khetam Al Sharou, Lucia Specia
We also study the impact of the source text on generating critical errors in the translation and, based on this, propose a set of recommendations on aspects of the MT that need further scrutiny, especially for user-generated content, to avoid generating such errors, and hence improve online communication.