Same same, but different: Compositionality of paraphrase granularity levels

Darina Benikova, Torsten Zesch


Abstract
Paraphrases exist on different granularity levels, the most frequently used one being the sentential level. However, we argue that working on the sentential level is not optimal for both machines and humans, and that it would be easier and more efficient to work on sub-sentential levels. To prove this, we quantify and analyze the difference between paraphrases on both sentence and sub-sentence level in order to show the significance of the problem. First results on a preliminary dataset seem to confirm our hypotheses.
Anthology ID:
R17-1014
Volume:
Proceedings of the International Conference Recent Advances in Natural Language Processing, RANLP 2017
Month:
September
Year:
2017
Address:
Varna, Bulgaria
Editors:
Ruslan Mitkov, Galia Angelova
Venue:
RANLP
SIG:
Publisher:
INCOMA Ltd.
Note:
Pages:
90–96
Language:
URL:
https://doi.org/10.26615/978-954-452-049-6_014
DOI:
10.26615/978-954-452-049-6_014
Bibkey:
Cite (ACL):
Darina Benikova and Torsten Zesch. 2017. Same same, but different: Compositionality of paraphrase granularity levels. In Proceedings of the International Conference Recent Advances in Natural Language Processing, RANLP 2017, pages 90–96, Varna, Bulgaria. INCOMA Ltd..
Cite (Informal):
Same same, but different: Compositionality of paraphrase granularity levels (Benikova & Zesch, RANLP 2017)
Copy Citation:
PDF:
https://doi.org/10.26615/978-954-452-049-6_014