A French Eye-Tracking Corpus of Original and Simplified Medical, Clinical, and General Texts - FETA

Oksana Ivchenko, Natalia Grabar


Abstract
Eye tracking offers an objective window on real-time cognitive processing of information being read: longer fixations, more regressions, and wider pupil dilation reliably index linguistic difficulty. Yet, there is a paucity of the available corpora annotated with eye-tracking features. We introduce in this paper the FETA corpus – a French Eye-TrAcking corpus. It combines three types of texts (general, medical and clinical) in two versions (original and manually simplified). These texts are read by 46 participants, from which we collect eye-tracking data through dozens of eye-tracking features.
Anthology ID:
2025.gaze4nlp-1.5
Volume:
Proceedings of the First International Workshop on Gaze Data and Natural Language Processing
Month:
September
Year:
2025
Address:
Varna, Bulgaria
Editors:
Cengiz Acarturk, Jamal Nasir, Burcu Can, Cagrı Coltekin
Venues:
Gaze4NLP | WS
SIG:
Publisher:
INCOMA Ltd., Shoumen, BULGARIA
Note:
Pages:
37–43
Language:
URL:
https://aclanthology.org/2025.gaze4nlp-1.5/
DOI:
Bibkey:
Cite (ACL):
Oksana Ivchenko and Natalia Grabar. 2025. A French Eye-Tracking Corpus of Original and Simplified Medical, Clinical, and General Texts - FETA. In Proceedings of the First International Workshop on Gaze Data and Natural Language Processing, pages 37–43, Varna, Bulgaria. INCOMA Ltd., Shoumen, BULGARIA.
Cite (Informal):
A French Eye-Tracking Corpus of Original and Simplified Medical, Clinical, and General Texts - FETA (Ivchenko & Grabar, Gaze4NLP 2025)
Copy Citation:
PDF:
https://aclanthology.org/2025.gaze4nlp-1.5.pdf