Building Tempo-HindiWordNet: A resource for effective temporal information access in Hindi

Dipawesh Pawar, Mohammed Hasanuzzaman, Asif Ekbal


Abstract
In this paper, we put forward a strategy that supplements Hindi WordNet entries with information on the temporality of its word senses. Each synset of Hindi WordNet is automatically annotated to one of the five dimensions: past, present, future, neutral and atemporal. We use semi-supervised learning strategy to build temporal classifiers over the glosses of manually selected initial seed synsets. The classification process is iterated based on the repetitive confidence based expansion strategy of the initial seed list until cross-validation accuracy drops. The resource is unique in its nature as, to the best of our knowledge, still no such resource is available for Hindi.
Anthology ID:
L16-1595
Volume:
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
Month:
May
Year:
2016
Address:
Portorož, Slovenia
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Sara Goggi, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
3752–3759
Language:
URL:
https://aclanthology.org/L16-1595
DOI:
Bibkey:
Cite (ACL):
Dipawesh Pawar, Mohammed Hasanuzzaman, and Asif Ekbal. 2016. Building Tempo-HindiWordNet: A resource for effective temporal information access in Hindi. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 3752–3759, Portorož, Slovenia. European Language Resources Association (ELRA).
Cite (Informal):
Building Tempo-HindiWordNet: A resource for effective temporal information access in Hindi (Pawar et al., LREC 2016)
Copy Citation:
PDF:
https://aclanthology.org/L16-1595.pdf