4Couv: A New Treebank for French

Philippe Blache, Grégoire de Montcheuil, Laurent Prévot, Stéphane Rauzy


Abstract
The question of the type of text used as primary data in treebanks is of certain importance. First, it has an influence at the discourse level: an article is not organized in the same way as a novel or a technical document. Moreover, it also has consequences in terms of semantic interpretation: some types of texts can be easier to interpret than others. We present in this paper a new type of treebank which presents the particularity to answer to specific needs of experimental linguistic. It is made of short texts (book backcovers) that presents a strong coherence in their organization and can be rapidly interpreted. This type of text is adapted to short reading sessions, making it easy to acquire physiological data (e.g. eye movement, electroencepholagraphy). Such a resource offers reliable data when looking for correlations between computational models and human language processing.
Anthology ID:
L16-1245
Volume:
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
Month:
May
Year:
2016
Address:
Portorož, Slovenia
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Sara Goggi, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
1546–1551
Language:
URL:
https://aclanthology.org/L16-1245
DOI:
Bibkey:
Cite (ACL):
Philippe Blache, Grégoire de Montcheuil, Laurent Prévot, and Stéphane Rauzy. 2016. 4Couv: A New Treebank for French. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 1546–1551, Portorož, Slovenia. European Language Resources Association (ELRA).
Cite (Informal):
4Couv: A New Treebank for French (Blache et al., LREC 2016)
Copy Citation:
PDF:
https://aclanthology.org/L16-1245.pdf