A Graphical Citation Browser for the ACL Anthology

Benjamin Weitz, Ulrich Schäfer


Abstract
Navigation in large scholarly paper collections is tedious and not well supported in most scientific digital libraries. We describe a novel browser-based graphical tool implemented using HTML5 Canvas. It displays citation information extracted from the paper text to support useful navigation. The tool is implemented using a client/server architecture. A citation graph of the digital library is built in the memory of the server. On the client side, egdes of the displayed citation (sub)graph surrounding a document are labeled with keywords signifying the kind of citation made from one document to another. These keywords were extracted using NLP tools such as tokenizer, sentence boundary detection and part-of-speech tagging applied to the text extracted from the original PDF papers (currently 22,500). By clicking on an egde, the user can inspect the corresponding citation sentence in context, in most cases even also highlighted in the original PDF layout. The system is publicly accessible as part of the ACL Anthology Searchbench.
Anthology ID:
L12-1474
Volume:
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
Month:
May
Year:
2012
Address:
Istanbul, Turkey
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Mehmet Uğur Doğan, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
1718–1722
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/805_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Benjamin Weitz and Ulrich Schäfer. 2012. A Graphical Citation Browser for the ACL Anthology. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), pages 1718–1722, Istanbul, Turkey. European Language Resources Association (ELRA).
Cite (Informal):
A Graphical Citation Browser for the ACL Anthology (Weitz & Schäfer, LREC 2012)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/805_Paper.pdf