no code implementations • NoDaLiDa 2021 • Hinrik Hafsteinsson, Anton Karl Ingason
We describe the process of conversion between the PoS tagging schemes of two languages, the Icelandic MIM-GOLD tagging scheme and the Faroese Sosialurin tagging scheme.
no code implementations • UDW (COLING) 2020 • Þórunn Arnardóttir, Hinrik Hafsteinsson, Einar Freyr Sigurðsson, Kristín Bjarnadóttir, Anton Karl Ingason, Hildur Jónsdóttir, Steinþór Steingrímsson
The topic of this paper is a rule-based pipeline for converting constituency treebanks based on the Penn Treebank format to Universal Dependencies (UD).
no code implementations • ICON 2020 • Hinrik Hafsteinsson, Anton Karl Ingason
We describe the development of a dedicated, high-accuracy part-of-speech (PoS) tagging solution for Faroese, a North Germanic language with about 50, 000 speakers.
no code implementations • LREC 2020 • Hildur J{\'o}nsd{\'o}ttir, Anton Karl Ingason
Two Icelandic treebanks based on phrase-structure grammar have been built and ongoing work aims to convert them to UD.
no code implementations • LREC 2020 • Steinunn Rut Fri{\dh}riksd{\'o}ttir, Anton Karl Ingason
Most spell checkers automatically detect spelling mistakes by looking up individual words and seeing if they exist in the vocabulary.
1 code implementation • LREC 2020 • Anna Björk Nikulásdóttir, Jón Guðnason, Anton Karl Ingason, Hrafn Loftsson, Eiríkur Rögnvaldsson, Einar Freyr Sigurðsson, Steinþór Steingrímsson
In this paper, we describe a new national language technology programme for Icelandic.
no code implementations • LREC 2014 • Anton Karl Ingason, Hrafn Loftsson, Eir{\'\i}kur R{\"o}gnvaldsson, Einar Freyr Sigur{\dh}sson, Joel C. Wallenberg
This paper presents ongoing work that aims to improve machine parsing of Faroese using a combination of Faroese and Icelandic training data.
1 code implementation • LREC 2012 • Eir{\'\i}kur R{\"o}gnvaldsson, Anton Karl Ingason, Einar Freyr Sigur{\dh}sson, Joel Wallenberg
We describe the background for and building of IcePaHC, a one million word parsed historical corpus of Icelandic which has just been finished.