VerbCROcean: A Repository of Fine-Grained Semantic Verb Relations for Croatian

Ivan Sekulić, Jan Šnajder


Abstract
In this paper we describe VerbCROcean, a broad-coverage repository of fine-grained semantic relations between Croatian verbs. Adopting the methodology of Chklovski and Pantel (2004) used for acquiring the English VerbOcean, we first acquire semantically related verb pairs from a web corpus hrWaC by relying on distributional similarity of subject-verb-object paths in the dependency trees. We then classify the semantic relations between each pair of verbs as similarity, intensity, antonymy, or happens-before, using a number of manually-constructed lexico-syntatic patterns. We evaluate the quality of the resulting resource on a manually annotated sample of 1000 semantic verb relations. The evaluation revealed that the predictions are most accurate for the similarity relation, and least accurate for the intensity relation. We make available two variants of VerbCROcean: a coverage-oriented version, containing about 36k verb pairs at a precision of 41%, and a precision-oriented version containing about 5k verb pairs, at a precision of 56%.
Anthology ID:
L16-1425
Volume:
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
Month:
May
Year:
2016
Address:
Portorož, Slovenia
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
2676–2681
Language:
URL:
https://aclanthology.org/L16-1425
DOI:
Bibkey:
Cite (ACL):
Ivan Sekulić and Jan Šnajder. 2016. VerbCROcean: A Repository of Fine-Grained Semantic Verb Relations for Croatian. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 2676–2681, Portorož, Slovenia. European Language Resources Association (ELRA).
Cite (Informal):
VerbCROcean: A Repository of Fine-Grained Semantic Verb Relations for Croatian (Sekulić & Šnajder, LREC 2016)
Copy Citation:
PDF:
https://aclanthology.org/L16-1425.pdf