Holmer Hemsen


pdf bib
Streaming Text Analytics for Real-Time Event Recognition
Philippe Thomas | Johannes Kirschnick | Leonhard Hennig | Renlong Ai | Sven Schmeier | Holmer Hemsen | Feiyu Xu | Hans Uszkoreit
Proceedings of the International Conference Recent Advances in Natural Language Processing, RANLP 2017

A huge body of continuously growing written knowledge is available on the web in the form of social media posts, RSS feeds, and news articles. Real-time information extraction from such high velocity, high volume text streams requires scalable, distributed natural language processing pipelines. We introduce such a system for fine-grained event recognition within the big data framework Flink, and demonstrate its capabilities for extracting and geo-locating mobility- and industry-related events from heterogeneous text sources. Performance analyses conducted on several large datasets show that our system achieves high throughput and maintains low latency, which is crucial when events need to be detected and acted upon in real-time. We also present promising experimental results for the event extraction component of our system, which recognizes a novel set of event types. The demo system is available at http://dfki.de/sd4m-sta-demo/.


pdf bib
JEDI: Joint Entity and Relation Detection using Type Inference
Johannes Kirschnick | Holmer Hemsen | Volker Markl
Proceedings of ACL-2016 System Demonstrations


pdf bib
Freepal: A Large Collection of Deep Lexico-Syntactic Patterns for Relation Extraction
Johannes Kirschnick | Alan Akbik | Holmer Hemsen
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)

The increasing availability and maturity of both scalable computing architectures and deep syntactic parsers is opening up new possibilities for Relation Extraction (RE) on large corpora of natural language text. In this paper, we present Freepal, a resource designed to assist with the creation of relation extractors for more than 5,000 relations defined in the Freebase knowledge base (KB). The resource consists of over 10 million distinct lexico-syntactic patterns extracted from dependency trees, each of which is assigned to one or more Freebase relations with different confidence strengths. We generate the resource by executing a large-scale distant supervision approach on the ClueWeb09 corpus to extract and parse over 260 million sentences labeled with Freebase entities and relations. We make Freepal freely available to the research community, and present a web demonstrator to the dataset, accessible from free-pal.appspot.com.

pdf bib
A Marketplace for Web Scale Analytics and Text Annotation Services
Johannes Kirschnick | Torsten Kilias | Holmer Hemsen | Alexander Löser | Peter Adolphs | Heiko Ehrig | Holger Düwiger
Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: System Demonstrations


pdf bib
Unsupervised Discovery of Relations and Discriminative Extraction Patterns
Alan Akbik | Larysa Visengeriyeva | Priska Herger | Holmer Hemsen | Alexander Löser
Proceedings of COLING 2012


pdf bib
Unsupervised Relation Extraction From Web Documents
Kathrin Eichler | Holmer Hemsen | Günter Neumann
Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)

The IDEX system is a prototype of an interactive dynamic Information Extraction (IE) system. A user of the system expresses an information request in the form of a topic description, which is used for an initial search in order to retrieve a relevant set of documents. On basis of this set of documents, unsupervised relation extraction and clustering is done by the system. The results of these operations can then be interactively inspected by the user. In this paper we describe the relation extraction and clustering components of the IDEX system. Preliminary evaluation results of these components are presented and an overview is given of possible enhancements to improve the relation extraction and clustering components.


pdf bib
Evaluation of a Multimodal Dialogue System for Small-screen Devices
Holmer Hemsen
Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC’04)


pdf bib
A XML-based tool for evaluation of SLDS
Marcela Charfuelán | Luis Hernández Gómez | Cristina Esteban López | Holmer Hemsen
Proceedings of the Third International Conference on Language Resources and Evaluation (LREC’02)