1 code implementation • sdp (COLING) 2022 • Óscar E. Mendoza, Wojciech Kusa, Alaa El-Ebshihy, Ronin Wu, David Pride, Petr Knoth, Drahomira Herrmannova, Florina Piroi, Gabriella Pasi, Allan Hanbury
We present a new gold-standard dataset and a benchmark for the Research Theme Identification task, a sub-task of the Scholarly Knowledge Graph Generation shared task, at the 3rd Workshop on Scholarly Document Processing.
1 code implementation • LREC 2022 • Suchetha Nambanoor Kunnath, Valentin Stauber, Ronin Wu, David Pride, Viktor Botev, Petr Knoth
This modified corpus has annotations for both citation function and importance classes newly enriched with supplementary contextual and non-contextual feature sets the selection of which follows from the lists of features used by the more successful teams in these shared tasks.
no code implementations • 20 Dec 2023 • Emily Groves, Minhong Wang, Yusuf Abdulle, Holger Kunz, Jason Hoelscher-Obermaier, Ronin Wu, Honghan Wu
Five setups were designed to assess ML and FT model performance across different data availability scenarios. Datasets for curation tasks included: task 1 (620, 386), task 2 (611, 430), and task 3 (617, 381), maintaining a 50:50 positive versus negative ratio.
no code implementations • 15 Nov 2022 • Corentin van den Broek d'Obrenan, Frédéric Galliano, Jeremy Minton, Viktor Botev, Ronin Wu
As a practical example, we have used NLP to conduct an interdisciplinary search for compounds that could be carriers for Diffuse Interstellar Bands (DIBs), a long-standing open question in astrophysics.
no code implementations • 27 Oct 2022 • Jason Hoelscher-Obermaier, Edward Stevinson, Valentin Stauber, Ivaylo Zhelev, Victor Botev, Ronin Wu, Jeremy Minton
The most interesting words in scientific texts will often be novel or rare.
no code implementations • 1 Nov 2021 • Mihalis Gongolidis, Jeremy Minton, Ronin Wu, Valentin Stauber, Jason Hoelscher-Obermaier, Viktor Botev
Two new document classification data-sets are collated from general and chemistry scientific journals to compare the proposed update training strategies with benchmark models.