Øystein Reigem
2017
Quote Extraction and Attribution from Norwegian Newspapers
Andrew Salway | Paul Meurer | Knut Hofland | Øystein Reigem
Proceedings of the 21st Nordic Conference on Computational Linguistics
Andrew Salway | Paul Meurer | Knut Hofland | Øystein Reigem
Proceedings of the 21st Nordic Conference on Computational Linguistics
2016
Topically-focused Blog Corpora for Multiple Languages
Andrew Salway | Dag Elgesem | Knut Hofland | Øystein Reigem | Lubos Steskal
Proceedings of the 10th Web as Corpus Workshop
Andrew Salway | Dag Elgesem | Knut Hofland | Øystein Reigem | Lubos Steskal
Proceedings of the 10th Web as Corpus Workshop
2006
Developing a re-usable web-demonstrator for automatic anaphora resolution with support for manual editing of coreference chains
Anders Nøklestad | Øystein Reigem | Christer Johansson
Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06)
Anders Nøklestad | Øystein Reigem | Christer Johansson
Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06)
Automatic markup and editing of anaphora and coreference is performed within one system. The processing is trained using memory based learning, and representations derive from various lexical resources. The current model reaches an expected combined precision and recall of F=62. The further improvement of the coreference detection is work in progress. Editing of coreference is separated into a module working on an xml-file. The editing mechanism can thus be reused in other projects. The editor is designed to store a copy on the server of all files that are edited over the internet using our demonstrator. This might help us to expand our database of texts annotated for anaphora and coreference. Further research includes creating high coverage lexical resources, and modules for other languages. The current system is trained on Norwegian bokm°al, but we hope to extend this to other languages with available tools (e.g. POS-taggers).