Dataset and Baseline for Automatic Student Feedback Analysis

Missaka Herath, Kushan Chamindu, Hashan Maduwantha, Surangika Ranathunga


Abstract
In this paper, we present a student feedback corpus, which contains 3000 instances of feedback written by university students. This dataset has been annotated for aspect terms, opinion terms, polarities of the opinion terms towards targeted aspects, document-level opinion polarities and sentence separations. We develop a hierarchical taxonomy for aspect categorization, which covers all the areas of the teaching-learning process. We annotated both implicit and explicit aspects using this taxonomy. Annotation methodology, difficulties faced during the annotation, and the details about the aspect term categorization have been discussed in detail. This annotated corpus can be used for Aspect Extraction, Aspect Level Sentiment Analysis, and Document Level Sentiment Analysis. Also the baseline results for all three tasks are given in the paper.
Anthology ID:
2022.lrec-1.219
Volume:
Proceedings of the Thirteenth Language Resources and Evaluation Conference
Month:
June
Year:
2022
Address:
Marseille, France
Editors:
Nicoletta Calzolari, Frédéric Béchet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
2042–2049
Language:
URL:
https://aclanthology.org/2022.lrec-1.219
DOI:
Bibkey:
Cite (ACL):
Missaka Herath, Kushan Chamindu, Hashan Maduwantha, and Surangika Ranathunga. 2022. Dataset and Baseline for Automatic Student Feedback Analysis. In Proceedings of the Thirteenth Language Resources and Evaluation Conference, pages 2042–2049, Marseille, France. European Language Resources Association.
Cite (Informal):
Dataset and Baseline for Automatic Student Feedback Analysis (Herath et al., LREC 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.lrec-1.219.pdf