Iraklis Varlamis


2017

pdf bib
A Graph-based Text Similarity Measure That Employs Named Entity Information
Leonidas Tsekouras | Iraklis Varlamis | George Giannakopoulos
Proceedings of the International Conference Recent Advances in Natural Language Processing, RANLP 2017

Text comparison is an interesting though hard task, with many applications in Natural Language Processing. This work introduces a new text-similarity measure, which employs named-entities’ information extracted from the texts and the n-gram graphs’ model for representing documents. Using OpenCalais as a named-entity recognition service and the JINSECT toolkit for constructing and managing n-gram graphs, the text similarity measure is embedded in a text clustering algorithm (k-Means). The evaluation of the produced clusters with various clustering validity metrics shows that the extraction of named entities at a first step can be profitable for the time-performance of similarity measures that are based on the n-gram graph representation without affecting the overall performance of the NLP task.

2010

pdf bib
SemanticRank: Ranking Keywords and Sentences Using Semantic Graphs
George Tsatsaronis | Iraklis Varlamis | Kjetil Nørvåg
Proceedings of the 23rd International Conference on Computational Linguistics (Coling 2010)