<?xml version="1.0" encoding="UTF-8" ?>
<volume id="W18">
  <paper id="6000">
    <title>Proceedings of the Second Workshop on Universal Dependencies (UDW 2018)</title>
    <editor>Marie-Catherine de Marneffe</editor>
    <editor>Teresa Lynn</editor>
    <editor>Sebastian Schuster</editor>
    <month>November</month>
    <year>2018</year>
    <address>Brussels, Belgium</address>
    <publisher>Association for Computational Linguistics</publisher>
    <url>http://www.aclweb.org/anthology/W18-60</url>
    <bibtype>book</bibtype>
    <bibkey>UDW2018:2018</bibkey>
  </paper>

  <paper id="6001">
    <title>Assessing the Impact of Incremental Error Detection and Correction. A Case Study on the Italian Universal Dependency Treebank</title>
    <author><first>Chiara</first><last>Alzetta</last></author>
    <author><first>Felice</first><last>Dell'Orletta</last></author>
    <author><first>Simonetta</first><last>Montemagni</last></author>
    <author><first>Maria</first><last>Simi</last></author>
    <author><first>Giulia</first><last>Venturi</last></author>
    <booktitle>Proceedings of the Second Workshop on Universal Dependencies (UDW 2018)</booktitle>
    <month>November</month>
    <year>2018</year>
    <address>Brussels, Belgium</address>
    <publisher>Association for Computational Linguistics</publisher>
    <pages>1&#8211;7</pages>
    <url>http://www.aclweb.org/anthology/W18-6001</url>
    <abstract>Detection and correction of errors and inconsistencies in "gold treebanks" are becoming more and more central topics of corpus annotation. The paper illustrates a new incremental method for enhancing treebanks, with particular emphasis on the extension of error patterns across different textual genres and registers. Impact and role of corrections have been assessed in a dependency parsing experiment carried out with four different parsers, whose results are promising. For both evaluation datasets, the performance of parsers increases, in terms of the standard LAS and UAS measures and of a more focused measure taking into account only relations involved in error patterns, as well as at the level of individual dependencies.</abstract>
    <bibtype>inproceedings</bibtype>
    <bibkey>alzetta-EtAl:2018:UDW2018</bibkey>
  </paper>

  <paper id="6002">
    <title>Using Universal Dependencies in cross-linguistic complexity research</title>
    <author><first>Aleksandrs</first><last>Berdicevskis</last></author>
    <author><first>Çağrı</first><last>Çöltekin</last></author>
    <author><first>Katharina</first><last>Ehret</last></author>
    <author><first>Kilu</first><last>von Prince</last></author>
    <author><first>Daniel</first><last>Ross</last></author>
    <author><first>Bill</first><last>Thompson</last></author>
    <author><first>Chunxiao</first><last>Yan</last></author>
    <author><first>Vera</first><last>Demberg</last></author>
    <author><first>Gary</first><last>Lupyan</last></author>
    <author><first>Taraka</first><last>Rama</last></author>
    <author><first>Christian</first><last>Bentz</last></author>
    <booktitle>Proceedings of the Second Workshop on Universal Dependencies (UDW 2018)</booktitle>
    <month>November</month>
    <year>2018</year>
    <address>Brussels, Belgium</address>
    <publisher>Association for Computational Linguistics</publisher>
    <pages>8&#8211;17</pages>
    <url>http://www.aclweb.org/anthology/W18-6002</url>
    <abstract>We evaluate corpus-based measures of linguistic complexity obtained using Universal Dependencies (UD) treebanks. We propose a method of estimating robustness of the complexity values obtained using a given measure and a given treebank. The results indicate that measures of syntactic complexity might be on average less robust than those of morphological complexity. We also estimate the validity of complexity measures by comparing the results for very similar languages and checking for unexpected differences. We show that some of those differences that arise can be diminished by using parallel treebanks and, more importantly from the practical point of view, by harmonizing the language-specific solutions in the UD annotation.</abstract>
    <bibtype>inproceedings</bibtype>
    <bibkey>berdicevskis-EtAl:2018:UDW2018</bibkey>
  </paper>

  <paper id="6003">
    <title>Expletives in Universal Dependency Treebanks</title>
    <author><first>Gosse</first><last>Bouma</last></author>
    <author><first>Jan</first><last>Hajic</last></author>
    <author><first>Dag</first><last>Haug</last></author>
    <author><first>Joakim</first><last>Nivre</last></author>
    <author><first>Per Erik</first><last>Solberg</last></author>
    <author><first>Lilja</first><last>Øvrelid</last></author>
    <booktitle>Proceedings of the Second Workshop on Universal Dependencies (UDW 2018)</booktitle>
    <month>November</month>
    <year>2018</year>
    <address>Brussels, Belgium</address>
    <publisher>Association for Computational Linguistics</publisher>
    <pages>18&#8211;26</pages>
    <url>http://www.aclweb.org/anthology/W18-6003</url>
    <abstract>Although treebanks annotated according to the guidelines of Universal Dependencies (UD) now exist for many languages, the goal of annotating the same phenomena in a cross-linguistically consistent fashion is not always met.</abstract>
    <bibtype>inproceedings</bibtype>
    <bibkey>bouma-EtAl:2018:UDW2018</bibkey>
  </paper>

  <paper id="6004">
    <title>Challenges in Converting the Index Thomisticus Treebank into Universal Dependencies</title>
    <author><first>Flavio Massimiliano</first><last>Cecchini</last></author>
    <author><first>Marco</first><last>Passarotti</last></author>
    <author><first>Paola</first><last>Marongiu</last></author>
    <author><first>Daniel</first><last>Zeman</last></author>
    <booktitle>Proceedings of the Second Workshop on Universal Dependencies (UDW 2018)</booktitle>
    <month>November</month>
    <year>2018</year>
    <address>Brussels, Belgium</address>
    <publisher>Association for Computational Linguistics</publisher>
    <pages>27&#8211;36</pages>
    <url>http://www.aclweb.org/anthology/W18-6004</url>
    <abstract>This paper describes the changes applied to the original process used to convert the Index Thomisticus Treebank, a corpus including texts in Medieval Latin by Thomas Aquinas, into the annotation style of Universal Dependencies. The changes are made both to harmonise the Universal Dependencies version of the Index Thomisticus Treebank with the two other available Latin treebanks and to fix errors and inconsistencies resulting from the original process. The paper details the treatment of different issues in PoS tagging, lemmatisation and assignment of dependency relations. Finally, it assesses the quality of the new conversion process by providing an evaluation against a gold standard.</abstract>
    <bibtype>inproceedings</bibtype>
    <bibkey>cecchini-EtAl:2018:UDW2018</bibkey>
  </paper>

  <paper id="6005">
    <title>Er ... well, it matters, right? On the role of data representations in spoken language dependency parsing</title>
    <author><first>Kaja</first><last>Dobrovoljc</last></author>
    <author><first>Matej</first><last>Martinc</last></author>
    <booktitle>Proceedings of the Second Workshop on Universal Dependencies (UDW 2018)</booktitle>
    <month>November</month>
    <year>2018</year>
    <address>Brussels, Belgium</address>
    <publisher>Association for Computational Linguistics</publisher>
    <pages>37&#8211;46</pages>
    <url>http://www.aclweb.org/anthology/W18-6005</url>
    <abstract>Despite the significant improvement of data-driven dependency parsing systems in recent years, they still achieve a considerably lower performance in parsing spoken language data in comparison to written data. On the example of Spoken Slovenian Treebank, the first spoken data treebank using the UD annotation scheme, we investigate which speech-specific phenomena undermine parsing performance, through a series of training data and treebank modification experiments using two distinct state-of-the-art parsing systems. Our results show that segmentation is the most prominent cause of low parsing performance, both in parsing raw and pre-segmented transcriptions. In addition to shorter utterances, both parsers perform better on normalized transcriptions including basic markers of prosody and excluding disfluencies, discourse markers and fillers. On the other hand, the effects of written training data addition and speech-specific dependency representations largely depend on the parsing system selected.</abstract>
    <bibtype>inproceedings</bibtype>
    <bibkey>dobrovoljc-martinc:2018:UDW2018</bibkey>
  </paper>

  <paper id="6006">
    <title>Mind the Gap: Data Enrichment in Dependency Parsing of Elliptical Constructions</title>
    <author><first>Kira</first><last>Droganova</last></author>
    <author><first>Filip</first><last>Ginter</last></author>
    <author><first>Jenna</first><last>Kanerva</last></author>
    <author><first>Daniel</first><last>Zeman</last></author>
    <booktitle>Proceedings of the Second Workshop on Universal Dependencies (UDW 2018)</booktitle>
    <month>November</month>
    <year>2018</year>
    <address>Brussels, Belgium</address>
    <publisher>Association for Computational Linguistics</publisher>
    <pages>47&#8211;54</pages>
    <url>http://www.aclweb.org/anthology/W18-6006</url>
    <abstract>In this paper, we focus on parsing rare and non-trivial constructions, in particular ellipsis. We report on several experiments in enrichment of training data for this specific construction, evaluated on five languages: Czech, English, Finnish, Russian and Slovak.</abstract>
    <bibtype>inproceedings</bibtype>
    <bibkey>droganova-EtAl:2018:UDW2018</bibkey>
  </paper>

  <paper id="6007">
    <title>Integration complexity and the order of cosisters</title>
    <author><first>William</first><last>Dyer</last></author>
    <booktitle>Proceedings of the Second Workshop on Universal Dependencies (UDW 2018)</booktitle>
    <month>November</month>
    <year>2018</year>
    <address>Brussels, Belgium</address>
    <publisher>Association for Computational Linguistics</publisher>
    <pages>55&#8211;65</pages>
    <url>http://www.aclweb.org/anthology/W18-6007</url>
    <abstract>The cost of integrating dependent constituents to their heads is thought to involve the distance between dependent and head and the complexity of the integration (Gibson, 1998). The former has been convincingly addressed by Dependency Distance Minimization (DDM) (cf. Liu et al., 2017). The current study addresses the latter by proposing a novel theory of integration complexity derived from the entropy of the probability distribution of a dependent’s heads. An analysis of Universal Dependency corpora provides empirical evidence regarding the preferred order of isomorphic cosisters—sister constituents of the same syntactic form on the same side of their head—such as the adjectives in "pretty blue fish." Integration complexity, alongside DDM, allows for a general theory of constituent order based on integration cost.</abstract>
    <bibtype>inproceedings</bibtype>
    <bibkey>dyer:2018:UDW2018</bibkey>
  </paper>

  <paper id="6008">
    <title>SUD or Surface-Syntactic Universal Dependencies: An annotation scheme near-isomorphic to UD</title>
    <author><first>Kim</first><last>Gerdes</last></author>
    <author><first>Bruno</first><last>Guillaume</last></author>
    <author><first>Sylvain</first><last>Kahane</last></author>
    <author><first>Guy</first><last>Perrier</last></author>
    <booktitle>Proceedings of the Second Workshop on Universal Dependencies (UDW 2018)</booktitle>
    <month>November</month>
    <year>2018</year>
    <address>Brussels, Belgium</address>
    <publisher>Association for Computational Linguistics</publisher>
    <pages>66&#8211;74</pages>
    <url>http://www.aclweb.org/anthology/W18-6008</url>
    <abstract>This article proposes a surface-syntactic annotation scheme called SUD that is near-isomorphic to the Universal Dependencies (UD) annotation scheme while following distributional criteria for defining the dependency tree structure and the naming of the syntactic functions. Rule-based graph transformation grammars allow for a bi-directional transformation of UD into SUD. The back-and-forth transformation can also be seen as a powerful error-mining tool to assure the intra-language and inter-language coherence of the UD treebanks.</abstract>
    <bibtype>inproceedings</bibtype>
    <bibkey>gerdes-EtAl:2018:UDW2018</bibkey>
  </paper>

  <paper id="6009">
    <title>Coordinate Structures in Universal Dependencies for Head-final Languages</title>
    <author><first>Hiroshi</first><last>Kanayama</last></author>
    <author><first>Na-Rae</first><last>Han</last></author>
    <author><first>Masayuki</first><last>Asahara</last></author>
    <author><first>Jena D.</first><last>Hwang</last></author>
    <author><first>Yusuke</first><last>Miyao</last></author>
    <author><first>Jinho D.</first><last>Choi</last></author>
    <author><first>Yuji</first><last>Matsumoto</last></author>
    <booktitle>Proceedings of the Second Workshop on Universal Dependencies (UDW 2018)</booktitle>
    <month>November</month>
    <year>2018</year>
    <address>Brussels, Belgium</address>
    <publisher>Association for Computational Linguistics</publisher>
    <pages>75&#8211;84</pages>
    <url>http://www.aclweb.org/anthology/W18-6009</url>
    <abstract>This paper discusses the representation of coordinate structures in the Universal Dependencies framework for two head-final languages,Japanese and Korean. UD applies a strict principle that makes the head of coordination the left-most conjunct. However, the guideline may produce syntactic trees which are difficult to accept in head-final languages. This paper describes the status in the current corpora and proposes alternative designs suitable for these languages.</abstract>
    <bibtype>inproceedings</bibtype>
    <bibkey>kanayama-EtAl:2018:UDW2018</bibkey>
  </paper>

  <paper id="6010">
    <title>Investigating NP-Chunking with Universal Dependencies for English</title>
    <author><first>Oph&#233;lie</first><last>Lacroix</last></author>
    <booktitle>Proceedings of the Second Workshop on Universal Dependencies (UDW 2018)</booktitle>
    <month>November</month>
    <year>2018</year>
    <address>Brussels, Belgium</address>
    <publisher>Association for Computational Linguistics</publisher>
    <pages>85&#8211;90</pages>
    <url>http://www.aclweb.org/anthology/W18-6010</url>
    <abstract>Chunking is a pre-processing task generally dedicated to improving constituency parsing. In this paper, we want to show that universal dependency (UD) parsing can also leverage the information provided by the task of chunking even though annotated chunks are not provided with universal dependency trees. In particular, we introduce the possibility of deducing noun-phrase (NP) chunks from universal dependencies, focusing on English as a first example. We then demonstrate how the task of NP-chunking can benefit PoS-tagging in a multi-task learning setting - comparing two different strategies - and how it can be used as a feature for dependency parsing in order to learn enriched models.</abstract>
    <bibtype>inproceedings</bibtype>
    <bibkey>lacroix:2018:UDW2018</bibkey>
  </paper>

  <paper id="6011">
    <title>Marrying Universal Dependencies and Universal Morphology</title>
    <author><first>Arya D.</first><last>McCarthy</last></author>
    <author><first>Miikka</first><last>Silfverberg</last></author>
    <author><first>Ryan</first><last>Cotterell</last></author>
    <author><first>Mans</first><last>Hulden</last></author>
    <author><first>David</first><last>Yarowsky</last></author>
    <booktitle>Proceedings of the Second Workshop on Universal Dependencies (UDW 2018)</booktitle>
    <month>November</month>
    <year>2018</year>
    <address>Brussels, Belgium</address>
    <publisher>Association for Computational Linguistics</publisher>
    <pages>91&#8211;101</pages>
    <url>http://www.aclweb.org/anthology/W18-6011</url>
    <abstract>The Universal Dependencies (UD) and Universal Morphology (UniMorph) projects each present schemata for annotating the morphosyntactic details of language. Each project also provides corpora of annotated text in many languages—UD at the token level and UniMorph at the type level. As each corpus is built by different annotators, language-specific decisions hinder the goal of universal schemata. With compatibility of tags, each project’s annotations could be used to validate the other’s. Additionally, the availability of both type- and token-level resources would be a boon to tasks such as parsing and homograph disambiguation. To ease this interoperability, we present a deterministic mapping from Universal Dependencies v2 features into the UniMorph schema. We validate our approach by lookup in the UniMorph corpora and find a macro-average of 64.13% recall. We also note incompatibilities due to paucity of data on either side. Finally, we present a critical evaluation of the foundations, strengths, and weaknesses of the two annotation projects.</abstract>
    <bibtype>inproceedings</bibtype>
    <bibkey>mccarthy-EtAl:2018:UDW2018</bibkey>
  </paper>

  <paper id="6012">
    <title>Enhancing Universal Dependency Treebanks: A Case Study</title>
    <author><first>Joakim</first><last>Nivre</last></author>
    <author><first>Paola</first><last>Marongiu</last></author>
    <author><first>Filip</first><last>Ginter</last></author>
    <author><first>Jenna</first><last>Kanerva</last></author>
    <author><first>Simonetta</first><last>Montemagni</last></author>
    <author><first>Sebastian</first><last>Schuster</last></author>
    <author><first>Maria</first><last>Simi</last></author>
    <booktitle>Proceedings of the Second Workshop on Universal Dependencies (UDW 2018)</booktitle>
    <month>November</month>
    <year>2018</year>
    <address>Brussels, Belgium</address>
    <publisher>Association for Computational Linguistics</publisher>
    <pages>102&#8211;107</pages>
    <url>http://www.aclweb.org/anthology/W18-6012</url>
    <abstract>We evaluate cross-lingual techniques for adding enhanced dependencies to existing treebanks in Universal Dependencies. We apply a rule-based system for English and a data-driven system trained on Finnish to Swedish and Italian. We find that both systems are accurate enough to bootstrap enhanced dependencies in existing UD treebanks. For Italian, results are even on par with those of a language-specific system.</abstract>
    <bibtype>inproceedings</bibtype>
    <bibkey>nivre-EtAl:2018:UDW2018</bibkey>
  </paper>

  <paper id="6013">
    <title>Enhancing Universal Dependencies for Korean</title>
    <author><first>Youngbin</first><last>Noh</last></author>
    <author><first>Jiyoon</first><last>Han</last></author>
    <author><first>Tae Hwan</first><last>Oh</last></author>
    <author><first>Hansaem</first><last>Kim</last></author>
    <booktitle>Proceedings of the Second Workshop on Universal Dependencies (UDW 2018)</booktitle>
    <month>November</month>
    <year>2018</year>
    <address>Brussels, Belgium</address>
    <publisher>Association for Computational Linguistics</publisher>
    <pages>108&#8211;116</pages>
    <url>http://www.aclweb.org/anthology/W18-6013</url>
    <abstract>In this paper, for the purpose of enhancing Universal Dependencies for the Korean language, we propose a modified method for mapping Korean Part-of-Speech(POS) tagset in relation to Universal Part-of-Speech (UPOS) tagset in order to enhance the Universal Dependencies for the Korean Language. Previous studies suggest that UPOS reflects several issues that influence dependency annotation by using the POS of Korean predicates, particularly the distinctiveness in using verb, adjective, and copula.</abstract>
    <bibtype>inproceedings</bibtype>
    <bibkey>noh-EtAl:2018:UDW2018</bibkey>
  </paper>

  <paper id="6014">
    <title>UD-Japanese BCCWJ: Universal Dependencies Annotation for the Balanced Corpus of Contemporary Written Japanese</title>
    <author><first>Mai</first><last>Omura</last></author>
    <author><first>Masayuki</first><last>Asahara</last></author>
    <booktitle>Proceedings of the Second Workshop on Universal Dependencies (UDW 2018)</booktitle>
    <month>November</month>
    <year>2018</year>
    <address>Brussels, Belgium</address>
    <publisher>Association for Computational Linguistics</publisher>
    <pages>117&#8211;125</pages>
    <url>http://www.aclweb.org/anthology/W18-6014</url>
    <abstract>In this paper, we describe a corpus UD Japanese-BCCWJ that was created by converting the Balanced Corpus of Contemporary Written Japanese (BCCWJ), a Japanese language corpus, to adhere to the UD annotation schema. The BCCWJ already assigns dependency information at the level of the bunsetsu (a Japanese syntactic unit comparable to the phrase). We developed a program to convert the BCCWJ to UD based on this dependency structure, and this corpus is the result of completely automatic conversion using the program. UD Japanese-BCCWJ is the largest-scale UD Japanese corpus and the second-largest of all UD corpora, including 1,980 documents, 57,109 sentences, and 1,273k words across six distinct domains.</abstract>
    <bibtype>inproceedings</bibtype>
    <bibkey>omura-asahara:2018:UDW2018</bibkey>
  </paper>

  <paper id="6015">
    <title>The First Komi-Zyrian Universal Dependencies Treebanks</title>
    <author><first>Niko</first><last>Partanen</last></author>
    <author><first>Rogier</first><last>Blokland</last></author>
    <author><first>KyungTae</first><last>Lim</last></author>
    <author><first>Thierry</first><last>Poibeau</last></author>
    <author><first>Michael</first><last>Rießler</last></author>
    <booktitle>Proceedings of the Second Workshop on Universal Dependencies (UDW 2018)</booktitle>
    <month>November</month>
    <year>2018</year>
    <address>Brussels, Belgium</address>
    <publisher>Association for Computational Linguistics</publisher>
    <pages>126&#8211;132</pages>
    <url>http://www.aclweb.org/anthology/W18-6015</url>
    <abstract>Two Komi-Zyrian treebanks were included in the Universal Dependencies 2.2 release. This article contextualizes the treebanks, discusses the process through which they were created, and outlines the future plans and timeline for the next improvements. Special attention is paid to the possibilities of using UD in the documentation and description of endangered languages.</abstract>
    <bibtype>inproceedings</bibtype>
    <bibkey>partanen-EtAl:2018:UDW2018</bibkey>
  </paper>

  <paper id="6016">
    <title>The Hebrew Universal Dependency Treebank: Past Present and Future</title>
    <author><first>Shoval</first><last>Sade</last></author>
    <author><first>Amit</first><last>Seker</last></author>
    <author><first>Reut</first><last>Tsarfaty</last></author>
    <booktitle>Proceedings of the Second Workshop on Universal Dependencies (UDW 2018)</booktitle>
    <month>November</month>
    <year>2018</year>
    <address>Brussels, Belgium</address>
    <publisher>Association for Computational Linguistics</publisher>
    <pages>133&#8211;143</pages>
    <url>http://www.aclweb.org/anthology/W18-6016</url>
    <abstract>The Hebrew treebank (HTB), consisting of 6221 morpho-syntactically annotated newspaper sentences, has been the only resource for training and validating Hebrew statistical parsers for almost two decades now. During these decades, the HTB has gone through a trajectory of automatic and semi-automatic conversions, until arriving at its current UDv2 form. In this work we set out to manually validate the UDv2 version and, accordingly, we apply scheme changes to bring the UD HTB into the same theoretical ground as the rest of UD. Our experimental results show that improving the linguistic coherence and internal consistency of the UD HTB has indeed led to improved syntactic parsing performance. At the same time, there is more to be done at the points of intersection with other linguistic processing layers, in particular, at the interface of UD with external morphological and lexical resources.</abstract>
    <bibtype>inproceedings</bibtype>
    <bibkey>sade-seker-tsarfaty:2018:UDW2018</bibkey>
  </paper>

  <paper id="6017">
    <title>Multi-source synthetic treebank creation for improved cross-lingual dependency parsing</title>
    <author><first>Francis</first><last>Tyers</last></author>
    <author><first>Mariya</first><last>Sheyanova</last></author>
    <author><first>Aleksandra</first><last>Martynova</last></author>
    <author><first>Pavel</first><last>Stepachev</last></author>
    <author><first>Konstantin</first><last>Vinogorodskiy</last></author>
    <booktitle>Proceedings of the Second Workshop on Universal Dependencies (UDW 2018)</booktitle>
    <month>November</month>
    <year>2018</year>
    <address>Brussels, Belgium</address>
    <publisher>Association for Computational Linguistics</publisher>
    <pages>144&#8211;150</pages>
    <url>http://www.aclweb.org/anthology/W18-6017</url>
    <abstract>This paper describes a method of creating synthetic treebanks for </abstract>
    <bibtype>inproceedings</bibtype>
    <bibkey>tyers-EtAl:2018:UDW2018</bibkey>
  </paper>

  <paper id="6018">
    <title>Toward Universal Dependencies for Shipibo-Konibo</title>
    <author><first>Alonso</first><last>V&#225;squez</last></author>
    <author><first>Renzo</first><last>Ego Aguirre</last></author>
    <author><first>Candy</first><last>Angulo</last></author>
    <author><first>John</first><last>Miller</last></author>
    <author><first>Claudia</first><last>Villanueva</last></author>
    <author><first>Željko</first><last>Agić</last></author>
    <author><first>Roberto</first><last>Zariquiey</last></author>
    <author><first>Arturo</first><last>Oncevay</last></author>
    <booktitle>Proceedings of the Second Workshop on Universal Dependencies (UDW 2018)</booktitle>
    <month>November</month>
    <year>2018</year>
    <address>Brussels, Belgium</address>
    <publisher>Association for Computational Linguistics</publisher>
    <pages>151&#8211;161</pages>
    <url>http://www.aclweb.org/anthology/W18-6018</url>
    <abstract>We present an initial version of the Universal Dependencies (UD) treebank for Shipibo-Konibo, the first South American, Amazonian, Panoan and Peruvian language with a resource built under UD. We describe the linguistic aspects of how the tagset was defined and the treebank was annotated; in addition we present our specific treatment of linguistic units called clitics. Although the treebank is still under development, it allowed us to perform a typological comparison against Spanish, the predominant language in Peru, and dependency syntax parsing experiments in both monolingual and cross-lingual approaches.</abstract>
    <bibtype>inproceedings</bibtype>
    <bibkey>vsquez-EtAl:2018:UDW2018</bibkey>
  </paper>

  <paper id="6019">
    <title>Transition-based Parsing with Lighter Feed-Forward Networks</title>
    <author><first>David</first><last>Vilares</last></author>
    <author><first>Carlos</first><last>G&#243;mez-Rodr&#237;guez</last></author>
    <booktitle>Proceedings of the Second Workshop on Universal Dependencies (UDW 2018)</booktitle>
    <month>November</month>
    <year>2018</year>
    <address>Brussels, Belgium</address>
    <publisher>Association for Computational Linguistics</publisher>
    <pages>162&#8211;172</pages>
    <url>http://www.aclweb.org/anthology/W18-6019</url>
    <abstract>We explore whether it is possible to build lighter parsers, that are statistically equivalent to their corresponding standard version, for a wide set of languages showing different structures and morphologies. As testbed, we use the Universal Dependencies and transition-based dependency parsers trained on feed-forward networks. For these, most existing research assumes de facto standard embedded features and relies on pre-computation tricks to obtain speedups. We explore how these features and their size can be reduced and whether this translates into speed-ups with a negligible impact on accuracy. The experiments show that grand-daughter features can be removed for the majority of treebanks without a significant (negative or positive) LAS difference. They also show how the size of the embeddings can be notably reduced.</abstract>
    <bibtype>inproceedings</bibtype>
    <bibkey>vilares-gmezrodrguez:2018:UDW2018</bibkey>
  </paper>

  <paper id="6020">
    <title>Extended and Enhanced Polish Dependency Bank in Universal Dependencies Format</title>
    <author><first>Alina</first><last>Wr&#243;blewska</last></author>
    <booktitle>Proceedings of the Second Workshop on Universal Dependencies (UDW 2018)</booktitle>
    <month>November</month>
    <year>2018</year>
    <address>Brussels, Belgium</address>
    <publisher>Association for Computational Linguistics</publisher>
    <pages>173&#8211;182</pages>
    <url>http://www.aclweb.org/anthology/W18-6020</url>
    <abstract>The paper presents the largest Polish Dependency Bank in Universal Dependencies format &#8211; PDBUD &#8211; with 22K trees and 352K tokens. PDBUD builds on its previous version, i.e. the Polish UD treebank (PL-SZ), and contains all 8K PL-SZ trees. The PL-SZ trees are checked and possibly corrected in the current edition of PDBUD. Further 14K trees are automatically converted from a new version of Polish Dependency Bank. The PDBUD trees are expanded with the enhanced edges encoding the shared dependents and the shared governors of the coordinated conjuncts and with the semantic roles of some dependents. The conducted evaluation experiments show that PDBUD is large enough for training a high-quality graph-based dependency parser for Polish.</abstract>
    <bibtype>inproceedings</bibtype>
    <bibkey>wrblewska:2018:UDW2018</bibkey>
  </paper>

  <paper id="6021">
    <title>Approximate Dynamic Oracle for Dependency Parsing with Reinforcement Learning</title>
    <author><first>Xiang</first><last>Yu</last></author>
    <author><first>Ngoc Thang</first><last>Vu</last></author>
    <author><first>Jonas</first><last>Kuhn</last></author>
    <booktitle>Proceedings of the Second Workshop on Universal Dependencies (UDW 2018)</booktitle>
    <month>November</month>
    <year>2018</year>
    <address>Brussels, Belgium</address>
    <publisher>Association for Computational Linguistics</publisher>
    <pages>183&#8211;191</pages>
    <url>http://www.aclweb.org/anthology/W18-6021</url>
    <abstract>We present a general approach with reinforcement learning (RL) to approximate dynamic oracles for transition systems where exact dynamic oracles are difficult to derive. We treat oracle parsing as a reinforcement learning problem, design the reward function inspired by the classical dynamic oracle, and use Deep Q-Learning (DQN) techniques to train the oracle with gold trees as features. The combination of a priori knowledge and data-driven methods enables an efficient dynamic oracle, which improves the parser performance over static oracles in several transition systems.</abstract>
    <bibtype>inproceedings</bibtype>
    <bibkey>yu-vu-kuhn:2018:UDW2018</bibkey>
  </paper>

  <paper id="6022">
    <title>The Coptic Universal Dependency Treebank</title>
    <author><first>Amir</first><last>Zeldes</last></author>
    <author><first>Mitchell</first><last>Abrams</last></author>
    <booktitle>Proceedings of the Second Workshop on Universal Dependencies (UDW 2018)</booktitle>
    <month>November</month>
    <year>2018</year>
    <address>Brussels, Belgium</address>
    <publisher>Association for Computational Linguistics</publisher>
    <pages>192&#8211;201</pages>
    <url>http://www.aclweb.org/anthology/W18-6022</url>
    <abstract>This paper presents the Coptic Universal Dependency Treebank, the first dependency treebank within the Egyptian subfamily of the Afro-Asiatic languages. We discuss the composition of the corpus, challenges in adapting the UD annotation scheme to existing conventions for annotating Coptic, and evaluate inter-annotator agreement on UD annotation for the language. Some specific constructions are taken as a starting point for discussing several more general UD annotation guidelines, in particular for appositions, ambiguous passivization, incorporation and object-doubling.</abstract>
    <bibtype>inproceedings</bibtype>
    <bibkey>zeldes-abrams:2018:UDW2018</bibkey>
  </paper>

</volume>

