no code implementations • NAACL (AmericasNLP) 2021 • Atticus Harrigan, Antti Arppe
This paper details a semi-automatic method of word clustering for the Algonquian language, Nêhiyawêwin (Plains Cree).
no code implementations • NAACL (AmericasNLP) 2021 • Daniel Dacanay, Atticus Harrigan, Arok Wolvengrey, Antti Arppe
One problem in the task of automatic semantic classification is the problem of determining the level on which to group lexical items.
no code implementations • ACL 2022 • William Lane, Atticus Harrigan, Antti Arppe
Accordingly, Lane and Bird (2020) proposed a finite state approach which maps prefixes in a language to a set of possible completions up to the next morpheme boundary, for the incremental building of complex words.
no code implementations • LREC 2022 • Joshua Holden, Christopher Cox, Antti Arppe
This paper describes the expansion of a finite state transducer (FST) for the transitive verb system of Tsuut’ina (ISO 639-3: srs), a Dene (Athabaskan) language spoken in Alberta, Canada.
no code implementations • LREC 2020 • Aleksi Sahala, Miikka Silfverberg, Antti Arppe, Krister Lind{\'e}n
Akkadian is a fairly well resourced extinct language that does not yet have a comprehensive morphological analyzer available.
no code implementations • LREC 2020 • Aleksi Sahala, Miikka Silfverberg, Antti Arppe, Krister Lind{\'e}n
Several Akkadian text corpora contain only the transliterated text.
no code implementations • COLING 2018 • Patrick Littell, Anna Kazantseva, Rol Kuhn, , Aidan Pine, Antti Arppe, Christopher Cox, Marie-Odile Junker
In this article, we discuss which text, speech, and image technologies have been developed, and would be feasible to develop, for the approximately 60 Indigenous languages spoken in Canada.
Optical Character Recognition Optical Character Recognition (OCR) +7
no code implementations • LREC 2016 • Isabell Hubert, Antti Arppe, Jordan Lachler, Eddie Antonio Santos
We are presenting our work on the creation of the first optical character recognition (OCR) model for Northern Haida, also known as Masset or Xaad Kil, a nearly extinct First Nations language spoken in the Haida Gwaii archipelago in British Columbia, Canada.