René Witte


The LODeXporter: Flexible Generation of Linked Open Data Triples from NLP Frameworks for Automatic Knowledge Base Construction
René Witte | Bahar Sateli
Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)


Combining Off-the-shelf Grammar and Spelling Tools for the Automatic Evaluation of Scientific Writing (AESW) Shared Task 2016
René Witte | Bahar Sateli
Proceedings of the 11th Workshop on Innovative Use of NLP for Building Educational Applications


Flexible Ontology Population from Text: The OwlExporter
René Witte | Ninus Khamis | Juergen Rilling
Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)

Ontology population from text is becoming increasingly important for NLP applications. Ontologies in OWL format provide for a standardized means of modeling, querying, and reasoning over large knowledge bases. Populated from natural language texts, they offer significant advantages over traditional export formats, such as plain XML. The development of text analysis systems has been greatly facilitated by modern NLP frameworks, such as the General Architecture for Text Engineering (GATE). However, ontology population is not currently supported by a standard component. We developed a GATE resource called the OwlExporter that allows to easily map existing NLP analysis pipelines to OWL ontologies, thereby allowing language engineers to create ontology population systems without requiring extensive knowledge of ontology APIs. A particular feature of our approach is the concurrent population and linking of a domainand NLP-ontology, including NLP-specific features such as safe reasoning over coreference chains.

Ontology-Based Extraction and Summarization of Protein Mutation Impact Information
Nona Naderi | René Witte
Proceedings of the 2010 Workshop on Biomedical Natural Language Processing


Minding the Source: Automatic Tagging of Reported Speech in Newspaper Articles
Ralf Krestel | Sabine Bergler | René Witte
Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)

Reported speech in the form of direct and indirect reported speech is an important indicator of evidentiality in traditional newspaper texts, but also increasingly in the new media that rely heavily on citation and quotation of previous postings, as for instance in blogs or newsgroups. This paper details the basic processing steps for reported speech analysis and reports on performance of an implementation in form of a GATE resource.


A Self-Learning Context-Aware Lemmatizer for German
Praharshana Perera | René Witte
Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing