A Study of Reuse and Plagiarism in LREC papers

Gil Francopoulo, Joseph Mariani, Patrick Paroubek


Abstract
The aim of this experiment is to present an easy way to compare fragments of texts in order to detect (supposed) results of copy & paste operations between articles in the domain of Natural Language Processing (NLP). The search space of the comparisons is a corpus labeled as NLP4NLP gathering a large part of the NLP field. The study is centered on LREC papers in both directions, first with an LREC paper borrowing a fragment of text from the collection, and secondly in the reverse direction with fragments of LREC documents borrowed and inserted in the collection.
Anthology ID:
L16-1298
Volume:
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
Month:
May
Year:
2016
Address:
Portorož, Slovenia
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Sara Goggi, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
1890–1897
Language:
URL:
https://aclanthology.org/L16-1298
DOI:
Bibkey:
Cite (ACL):
Gil Francopoulo, Joseph Mariani, and Patrick Paroubek. 2016. A Study of Reuse and Plagiarism in LREC papers. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 1890–1897, Portorož, Slovenia. European Language Resources Association (ELRA).
Cite (Informal):
A Study of Reuse and Plagiarism in LREC papers (Francopoulo et al., LREC 2016)
Copy Citation:
PDF:
https://aclanthology.org/L16-1298.pdf