Tom Kenter


pdf bib
Frugal Paradigm Completion
Alexander Erdmann | Tom Kenter | Markus Becker | Christian Schallhart
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics

Lexica distinguishing all morphologically related forms of each lexeme are crucial to many language technologies, yet building them is expensive. We propose a frugal paradigm completion approach that predicts all related forms in a morphological paradigm from as few manually provided forms as possible. It induces typological information during training which it uses to determine the best sources at test time. We evaluate our language-agnostic approach on 7 diverse languages. Compared to popular alternative approaches, ours reduces manual labor by 16-63% and is the most robust to typological variation.


pdf bib
Siamese CBOW: Optimizing Word Embeddings for Sentence Representations
Tom Kenter | Alexey Borisov | Maarten de Rijke
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)


pdf bib
Lexicon Construction and Corpus Annotation of Historical Language with the CoBaLT Editor
Tom Kenter | Tomaž Erjavec | Maja Žorga Dulmin | Darja Fišer
Proceedings of the 6th Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities