2016
pdf
bib
abs
Ancient Greek WordNet Meets the Dynamic Lexicon: the Example of the Fragments of the Greek Historians
Monica Berti
|
Yuri Bizzoni
|
Federico Boschetti
|
Gregory R. Crane
|
Riccardo Del Gratta
|
Tariq Yousef
Proceedings of the 8th Global WordNet Conference (GWC)
The Ancient Greek WordNet (AGWN) and the Dynamic Lexicon (DL) are multilingual resources to study the lexicon of Ancient Greek texts and their translations. Both AGWN and DL are works in progress that need accuracy improvement and manual validation. After a detailed description of the current state of each work, this paper illustrates a methodology to cross AGWN and DL data, in order to mutually score the items of each resource according to the evidence provided by the other resource. The training data is based on the corpus of the Digital Fragmenta Historicorum Graecorum (DFHG), which includes ancient Greek texts with Latin translations.
2014
pdf
bib
abs
The Making of Ancient Greek WordNet
Yuri Bizzoni
|
Federico Boschetti
|
Harry Diakoff
|
Riccardo Del Gratta
|
Monica Monachini
|
Gregory Crane
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
This paper describes the process of creation and review of a new lexico-semantic resource for the classical studies: AncientGreekWordNet. The candidate sets of synonyms (synsets) are extracted from Greek-English dictionaries, on the assumption that Greek words translated by the same English word or phrase have a high probability of being synonyms or at least semantically closely related. The process of validation and the web interface developed to edit and query the resource are described in detail. The lexical coverage of Ancient Greek WordNet is illustrated and the accuracy is evaluated. Finally, scenarios for exploiting the resource are discussed.
pdf
bib
abs
Open Philology at the University of Leipzig
Frederik Baumgardt
|
Giuseppe Celano
|
Gregory R. Crane
|
Stella Dee
|
Maryam Foradi
|
Emily Franzini
|
Greta Franzini
|
Monica Lent
|
Maria Moritz
|
Simona Stoyanova
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
The Open Philology Project at the University of Leipzig aspires to re-assert the value of philology in its broadest sense. Philology signifies the widest possible use of the linguistic record to enable a deep understanding of the complete lived experience of humanity. Pragmatically, we focus on Greek and Latin because (1) substantial collections and services are already available within these languages, (2) substantial user communities exist (c. 35,000 unique users a month at the Perseus Digital Library), and (3) a European-based project is better positioned to process extensive cultural heritage materials in these languages rather than in Chinese or Sanskrit. The Open Philology Project has been designed with the hope that it can contribute to any historical language that survives within the human record. It includes three tasks: (1) the creation of an open, extensible, repurposable collection of machine-readable linguistic sources; (2) the development of dynamic textbooks that use annotated corpora to customize the vocabulary and grammar of texts that learners want to read, and at the same time engage students in collaboratively producing new annotated data; (3) the establishment of new workflows for, and forms of, publication, from individual annotations with argumentation to traditional publications with integrated machine-actionable data.
pdf
bib
A New Implementation for Canonical Text Services
Jochen Tiepmar
|
Christoph Teichmann
|
Gerhard Heyer
|
Monica Berti
|
Gregory Crane
Proceedings of the 8th Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities (LaTeCH)
2013
pdf
bib
Active Learning for Dependency Parsing by A Committee of Parsers
Saeed Majidi
|
Gregory Crane
Proceedings of the 13th International Conference on Parsing Technologies (IWPT 2013)
2009
pdf
bib
Citations in the Digital Library of Classics: Extracting Canonical References by Using Conditional Random Fields
Matteo Romanello
|
Federico Boschetti
|
Gregory Crane
Proceedings of the 2009 Workshop on Text and Citation Analysis for Scholarly Digital Libraries (NLPIR4DL)
2008
pdf
bib
abs
The Annotation Guidelines of the Latin Dependency Treebank and Index Thomisticus Treebank: the Treatment of some specific Syntactic Constructions in Latin
David Bamman
|
Marco Passarotti
|
Roberto Busa
|
Gregory Crane
Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)
The paper describes the treatment of some specific syntactic constructions in two treebanks of Latin according to a common set of annotation guidelines. Both projects work within the theoretical framework of Dependency Grammar, which has been demonstrated to be an especially appropriate framework for the representation of languages with a moderately free word order, where the linear order of constituents is broken up with elements of other constituents. The two projects are the first of their kind for Latin, so no prior established guidelines for syntactic annotation are available to rely on. The general model for the adopted style of representation is that used by the Prague Dependency Treebank, with departures arising from the Latin grammar of Pinkster, specifically in the traditional grammatical categories of the ablative absolute, the accusative + infinitive, and gerunds/gerundives. Sharing common annotation guidelines allows us to compare the datasets of the two treebanks for tasks such as mutually checking annotation consistency, diachronically studying specific syntactic constructions, and training statistical dependency parsers.
2007
pdf
bib
The Latin Dependency Treebank in a Cultural Heritage Digital Library
David Bamman
|
Gregory Crane
Proceedings of the Workshop on Language Technology for Cultural Heritage Data (LaTeCH 2007).