Dataset Creation for Ranking Constructive News Comments

Soichiro Fujita, Hayato Kobayashi, Manabu Okumura


Abstract
Ranking comments on an online news service is a practically important task for the service provider, and thus there have been many studies on this task. However, most of them considered users’ positive feedback, such as “Like”-button clicks, as a quality measure. In this paper, we address directly evaluating the quality of comments on the basis of “constructiveness,” separately from user feedback. To this end, we create a new dataset including 100K+ Japanese comments with constructiveness scores (C-scores). Our experiments clarify that C-scores are not always related to users’ positive feedback, and the performance of pairwise ranking models tends to be enhanced by the variation of comments rather than articles.
Anthology ID:
P19-1250
Volume:
Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
Month:
July
Year:
2019
Address:
Florence, Italy
Editors:
Anna Korhonen, David Traum, Lluís Màrquez
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
2619–2626
Language:
URL:
https://aclanthology.org/P19-1250/
DOI:
10.18653/v1/P19-1250
Bibkey:
Cite (ACL):
Soichiro Fujita, Hayato Kobayashi, and Manabu Okumura. 2019. Dataset Creation for Ranking Constructive News Comments. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pages 2619–2626, Florence, Italy. Association for Computational Linguistics.
Cite (Informal):
Dataset Creation for Ranking Constructive News Comments (Fujita et al., ACL 2019)
Copy Citation:
PDF:
https://aclanthology.org/P19-1250.pdf
Poster:
 P19-1250.Poster.pdf