1 code implementation • 18 Feb 2024 • Yanran Chen, Wei Zhao, Anne Breitbarth, Manuel Stoeckel, Alexander Mehler, Steffen Eger
Even though we have evidence that recent parsers trained on modern treebanks are not heavily affected by data 'noise' such as spelling changes and OCR errors in our historic data, we find that results of syntactic language change are sensitive to the parsers involved, which is a caution against using a single parser for evaluating syntactic language change as done in previous work.
1 code implementation • LREC 2022 • Andy Lücking, Manuel Stoeckel, Giuseppe Abrami, Alexander Mehler
HeidelTime is one of the most widespread and successful tools for detecting temporal expressions in texts.
no code implementations • LREC 2020 • Manuel Stoeckel, Alex Henlein, Wahed Hemati, Alex Mehler, er
Since most of the available Latin word embeddings were trained on either few or inaccurate data, we trained several embeddings on better data in the first step.
no code implementations • LREC 2020 • Giuseppe Abrami, Manuel Stoeckel, Alex Mehler, er
The annotation of texts and other material in the field of digital humanities and Natural Language Processing (NLP) is a common task of research projects.
no code implementations • CONLL 2019 • Sajawel Ahmed, Manuel Stoeckel, Christine Driller, Adrian Pachzelt, Alex Mehler, er
The Specialized Information Service Biodiversity Research (BIOfid) has been launched to mobilize valuable biological data from printed literature hidden in German libraries for over the past 250 years.
no code implementations • WS 2019 • Manuel Stoeckel, Wahed Hemati, Alexander Mehler
The recognition of pharmacological substances, compounds and proteins is an essential preliminary work for the recognition of relations between chemicals and other biomedically relevant units.
no code implementations • 30 Jul 2019 • Manuel Stoeckel, Sajawel Ahmed, Alexander Mehler
We outperform knowledge-based WSD methods by up to 25% F1-score and produce a new state-of-the-art on the German sense-annotated dataset WebCAGe.