Automated Assessment of Noisy Crowdsourced Free-text Answers for Hindi in Low Resource Setting

Dolly Agarwal, Somya Gupta, Nishant Baghel


Abstract
The requirement of performing assessments continually on a larger scale necessitates the implementation of automated systems for evaluation of the learners’ responses to free-text questions. We target children of age group 8-14 years and use an ASR integrated assessment app to crowdsource learners’ responses to free text questions in Hindi. The app helped collect 39641 user answers to 35 different questions of Science topics. Since the users are young children from rural India and may not be well-equipped with technology, it brings in various noise types in the answers. We describe these noise types and propose a preprocessing pipeline to denoise user’s answers. We showcase the performance of different similarity metrics on the noisy and denoised versions of user and model answers. Our findings have large-scale applications for automated answer assessment for school children in India in low resource settings.
Anthology ID:
2020.wnut-1.17
Volume:
Proceedings of the Sixth Workshop on Noisy User-generated Text (W-NUT 2020)
Month:
November
Year:
2020
Address:
Online
Editors:
Wei Xu, Alan Ritter, Tim Baldwin, Afshin Rahimi
Venue:
WNUT
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
122–131
Language:
URL:
https://aclanthology.org/2020.wnut-1.17
DOI:
10.18653/v1/2020.wnut-1.17
Bibkey:
Cite (ACL):
Dolly Agarwal, Somya Gupta, and Nishant Baghel. 2020. Automated Assessment of Noisy Crowdsourced Free-text Answers for Hindi in Low Resource Setting. In Proceedings of the Sixth Workshop on Noisy User-generated Text (W-NUT 2020), pages 122–131, Online. Association for Computational Linguistics.
Cite (Informal):
Automated Assessment of Noisy Crowdsourced Free-text Answers for Hindi in Low Resource Setting (Agarwal et al., WNUT 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.wnut-1.17.pdf