TALEP at CMCL 2021 Shared Task: Non Linear Combination of Low and High-Level Features for Predicting Eye-Tracking Data

Franck Dary, Alexis Nasr, Abdellah Fourtassi


Abstract
In this paper we describe our contribution to the CMCL 2021 Shared Task, which consists in predicting 5 different eye tracking variables from English tokenized text. Our approach is based on a neural network that combines both raw textual features we extracted from the text and parser-based features that include linguistic predictions (e.g. part of speech) and complexity metrics (e.g., entropy of parsing). We found that both the features we considered as well as the architecture of the neural model that combined these features played a role in the overall performance. Our system achieved relatively high accuracy on the test data of the challenge and was ranked 2nd out of 13 competing teams and a total of 30 submissions.
Anthology ID:
2021.cmcl-1.13
Volume:
Proceedings of the Workshop on Cognitive Modeling and Computational Linguistics
Month:
June
Year:
2021
Address:
Online
Venue:
CMCL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
108–113
Language:
URL:
https://aclanthology.org/2021.cmcl-1.13
DOI:
10.18653/v1/2021.cmcl-1.13
Bibkey:
Cite (ACL):
Franck Dary, Alexis Nasr, and Abdellah Fourtassi. 2021. TALEP at CMCL 2021 Shared Task: Non Linear Combination of Low and High-Level Features for Predicting Eye-Tracking Data. In Proceedings of the Workshop on Cognitive Modeling and Computational Linguistics, pages 108–113, Online. Association for Computational Linguistics.
Cite (Informal):
TALEP at CMCL 2021 Shared Task: Non Linear Combination of Low and High-Level Features for Predicting Eye-Tracking Data (Dary et al., CMCL 2021)
Copy Citation:
PDF:
https://aclanthology.org/2021.cmcl-1.13.pdf
Optional supplementary material:
 2021.cmcl-1.13.OptionalSupplementaryMaterial.zip