Introducing CQuAE : A New French Contextualised Question-Answering Corpus for the Education Domain

Thomas Gerald, Anne Vilnat, Sofiane Ettayeb, Louis Tamames, Patrick Paroubek


Abstract
We present a new question answering corpus in French designed to educational domain. To be useful in such domain, we have to propose more complex questions and to be able to justify the answers on validated material. We analyze some properties of this corpus. The last part of this paper will be devoted to present the first experiments we have carried out to demonstrate the value of this dataset for learning a Retrieval Augmented Genration framework. Different experiments are proposed, with an automatic evaluation. A human evaluation is proposed to confirm or infirm this automatic evaluation.
Anthology ID:
2024.lrec-main.808
Volume:
Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)
Month:
May
Year:
2024
Address:
Torino, Italia
Editors:
Nicoletta Calzolari, Min-Yen Kan, Veronique Hoste, Alessandro Lenci, Sakriani Sakti, Nianwen Xue
Venues:
LREC | COLING
SIG:
Publisher:
ELRA and ICCL
Note:
Pages:
9234–9244
Language:
URL:
https://aclanthology.org/2024.lrec-main.808
DOI:
Bibkey:
Cite (ACL):
Thomas Gerald, Anne Vilnat, Sofiane Ettayeb, Louis Tamames, and Patrick Paroubek. 2024. Introducing CQuAE : A New French Contextualised Question-Answering Corpus for the Education Domain. In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pages 9234–9244, Torino, Italia. ELRA and ICCL.
Cite (Informal):
Introducing CQuAE : A New French Contextualised Question-Answering Corpus for the Education Domain (Gerald et al., LREC-COLING 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.lrec-main.808.pdf