Corpus-based Check-up for Thesaurus

Natalia Loukachevitch


Abstract
In this paper we discuss the usefulness of applying a checking procedure to existing thesauri. The procedure is based on the analysis of discrepancies of corpus-based and thesaurus-based word similarities. We applied the procedure to more than 30 thousand words of the Russian wordnet and found some serious errors in word sense description, including inaccurate relationships and missing senses of ambiguous words.
Anthology ID:
P19-1577
Volume:
Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
Month:
July
Year:
2019
Address:
Florence, Italy
Editors:
Anna Korhonen, David Traum, Lluís Màrquez
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
5773–5779
Language:
URL:
https://aclanthology.org/P19-1577
DOI:
10.18653/v1/P19-1577
Bibkey:
Cite (ACL):
Natalia Loukachevitch. 2019. Corpus-based Check-up for Thesaurus. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pages 5773–5779, Florence, Italy. Association for Computational Linguistics.
Cite (Informal):
Corpus-based Check-up for Thesaurus (Loukachevitch, ACL 2019)
Copy Citation:
PDF:
https://aclanthology.org/P19-1577.pdf
Video:
 https://aclanthology.org/P19-1577.mp4
Data
SemEval-2018 Task-9