Edit Categories and Editor Role Identification in Wikipedia

Diyi Yang, Aaron Halfaker, Robert Kraut, Eduard Hovy


Abstract
In this work, we introduced a corpus for categorizing edit types in Wikipedia. This fine-grained taxonomy of edit types enables us to differentiate editing actions and find editor roles in Wikipedia based on their low-level edit types. To do this, we first created an annotated corpus based on 1,996 edits obtained from 953 article revisions and built machine-learning models to automatically identify the edit categories associated with edits. Building on this automated measurement of edit types, we then applied a graphical model analogous to Latent Dirichlet Allocation to uncover the latent roles in editors’ edit histories. Applying this technique revealed eight different roles editors play, such as Social Networker, Substantive Expert, etc.
Anthology ID:
L16-1206
Volume:
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
Month:
May
Year:
2016
Address:
Portorož, Slovenia
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Sara Goggi, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
1295–1299
Language:
URL:
https://aclanthology.org/L16-1206
DOI:
Bibkey:
Cite (ACL):
Diyi Yang, Aaron Halfaker, Robert Kraut, and Eduard Hovy. 2016. Edit Categories and Editor Role Identification in Wikipedia. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 1295–1299, Portorož, Slovenia. European Language Resources Association (ELRA).
Cite (Informal):
Edit Categories and Editor Role Identification in Wikipedia (Yang et al., LREC 2016)
Copy Citation:
PDF:
https://aclanthology.org/L16-1206.pdf