Evaluating text coherence based on semantic similarity graph

Jan Wira Gotama Putra, Takenobu Tokunaga


Abstract
Coherence is a crucial feature of text because it is indispensable for conveying its communication purpose and meaning to its readers. In this paper, we propose an unsupervised text coherence scoring based on graph construction in which edges are established between semantically similar sentences represented by vertices. The sentence similarity is calculated based on the cosine similarity of semantic vectors representing sentences. We provide three graph construction methods establishing an edge from a given vertex to a preceding adjacent vertex, to a single similar vertex, or to multiple similar vertices. We evaluated our methods in the document discrimination task and the insertion task by comparing our proposed methods to the supervised (Entity Grid) and unsupervised (Entity Graph) baselines. In the document discrimination task, our method outperformed the unsupervised baseline but could not do the supervised baseline, while in the insertion task, our method outperformed both baselines.
Anthology ID:
W17-2410
Volume:
Proceedings of TextGraphs-11: the Workshop on Graph-based Methods for Natural Language Processing
Month:
August
Year:
2017
Address:
Vancouver, Canada
Editors:
Martin Riedl, Swapna Somasundaran, Goran Glavaš, Eduard Hovy
Venue:
TextGraphs
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
76–85
Language:
URL:
https://aclanthology.org/W17-2410
DOI:
10.18653/v1/W17-2410
Bibkey:
Cite (ACL):
Jan Wira Gotama Putra and Takenobu Tokunaga. 2017. Evaluating text coherence based on semantic similarity graph. In Proceedings of TextGraphs-11: the Workshop on Graph-based Methods for Natural Language Processing, pages 76–85, Vancouver, Canada. Association for Computational Linguistics.
Cite (Informal):
Evaluating text coherence based on semantic similarity graph (Putra & Tokunaga, TextGraphs 2017)
Copy Citation:
PDF:
https://aclanthology.org/W17-2410.pdf
Presentation:
 W17-2410.Presentation.pdf