2024
pdf
bib
abs
Mapping PropBank Argument Labels to Czech Verbal Valency
Jan Hajič
|
Eva Fučíková
|
Marketa Lopatkova
|
Zdeňka Urešová
Proceedings of the Fifth International Workshop on Designing Meaning Representations @ LREC-COLING 2024
For many years, there has been attempts to compare predicate-argument labeling schemas between formalism, typically under the dependency assumptions (even if the annotation by these schemas could have been performed on either constituent-based specifications or dependency ones). Given the growing number of resources that link various lexical resources to one another, as well as thanks to parallel annotated corpora (with or without annotation), it is now possible to do more in-depth studies of those correspondences. We present here a high-coverage pilot study of mapping the labeling system used in PropBank (for English) to Czech, which has so far used mainly valency lexicons (in several closely related forms) for annotation projects, under a different level of specification and different theoretical assumptions. The purpose of this study is both theoretical (comparing the argument labeling schemes) and practical (to be able to annotate Czech under the standard UMR specifications).
2020
pdf
bib
abs
Towards a Semi-Automatic Detection of Reflexive and Reciprocal Constructions and Their Representation in a Valency Lexicon
Václava Kettnerová
|
Marketa Lopatkova
|
Anna Vernerová
|
Petra Barancikova
Proceedings of the Twelfth Language Resources and Evaluation Conference
Valency lexicons usually describe valency behavior of verbs in non-reflexive and non-reciprocal constructions. However, reflexive and reciprocal constructions are common morphosyntactic forms of verbs. Both of these constructions are characterized by regular changes in morphosyntactic properties of verbs, thus they can be described by grammatical rules. On the other hand, the possibility to create reflexive and/or reciprocal constructions cannot be trivially derived from the morphosyntactic structure of verbs as it is conditioned by their semantic properties as well. A large-coverage valency lexicon allowing for rule based generation of all well formed verb constructions should thus integrate the information on reflexivity and reciprocity. In this paper, we propose a semi-automatic procedure, based on grammatical constraints on reflexivity and reciprocity, detecting those verbs that form reflexive and reciprocal constructions in corpus data. However, exploitation of corpus data for this purpose is complicated due to the diverse functions of reflexive markers crossing the domain of reflexivity and reciprocity. The list of verbs identified by the previous procedure is thus further used in an automatic experiment, applying word embeddings for detecting semantically similar verbs. These candidate verbs have been manually verified and annotation of their reflexive and reciprocal constructions has been integrated into the valency lexicon of Czech verbs VALLEX.
2019
pdf
bib
Reflexives in Czech from a Dependency Perspective
Vaclava Kettnerova
|
Marketa Lopatkova
Proceedings of the Fifth International Conference on Dependency Linguistics (Depling, SyntaxFest 2019)
2016
pdf
bib
abs
Alternations: From Lexicon to Grammar And Back Again
Markéta Lopatková
|
Václava Kettnerová
Proceedings of the Workshop on Grammar and Lexicon: interactions and interfaces (GramLex)
An excellent example of a phenomenon bridging a lexicon and a grammar is provided by grammaticalized alternations (e.g., passivization, reflexivity, and reciprocity): these alternations represent productive grammatical processes which are, however, lexically determined. While grammaticalized alternations keep lexical meaning of verbs unchanged, they are usually characterized by various changes in their morphosyntactic structure. In this contribution, we demonstrate on the example of reciprocity and its representation in the valency lexicon of Czech verbs, VALLEX how a linguistic description of complex (and still systemic) changes characteristic of grammaticalized alternations can benefit from an integration of grammatical rules into a valency lexicon. In contrast to other types of grammaticalized alternations, reciprocity in Czech has received relatively little attention although it closely interacts with various linguistic phenomena (e.g., with light verbs, diatheses, and reflexivity).
2015
pdf
bib
At the Lexicon-Grammar Interface: The Case of Complex Predicates in the Functional Generative Description
Václava Kettnerová
|
Markéta Lopatková
Proceedings of the Third International Conference on Dependency Linguistics (Depling 2015)
2014
pdf
bib
abs
CLARA: A New Generation of Researchers in Common Language Resources and Their Applications
Koenraad De Smedt
|
Erhard Hinrichs
|
Detmar Meurers
|
Inguna Skadiņa
|
Bolette Pedersen
|
Costanza Navarretta
|
Núria Bel
|
Krister Lindén
|
Markéta Lopatková
|
Jan Hajič
|
Gisle Andersen
|
Przemyslaw Lenkiewicz
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
CLARA (Common Language Resources and Their Applications) is a Marie Curie Initial Training Network which ran from 2009 until 2014 with the aim of providing researcher training in crucial areas related to language resources and infrastructure. The scope of the project was broad and included infrastructure design, lexical semantic modeling, domain modeling, multimedia and multimodal communication, applications, and parsing technologies and grammar models. An international consortium of 9 partners and 12 associate partners employed researchers in 19 new positions and organized a training program consisting of 10 thematic courses and summer/winter schools. The project has resulted in new theoretical insights as well as new resources and tools. Most importantly, the project has trained a new generation of researchers who can perform advanced research and development in language resources and technologies.
pdf
bib
abs
Automatic Mapping Lexical Resources: A Lexical Unit as the Keystone
Eduard Bejček
|
Václava Kettnerová
|
Markéta Lopatková
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
This paper presents the fully automatic linking of two valency lexicons of Czech verbs: VALLEX and PDT-VALLEX. Despite the same theoretical background adopted by these lexicons and the same linguistic phenomena they focus on, the fully automatic mapping of these resouces is not straightforward. We demonstrate that converting these lexicons into a common format represents a relatively easy part of the task whereas the automatic identification of pairs of corresponding valency frames (representing lexical units of verbs) poses difficulties. The overall achieved precision of 81% can be considered satisfactory. However, the higher number of lexical units a verb has, the lower the precision of their automatic mapping usually is. Moreover, we show that especially (i) supplementing further information on lexical units and (ii) revealing and reconciling regular discrepancies in their annotations can greatly assist in the automatic merging.
pdf
bib
abs
To Pay or to Get Paid: Enriching a Valency Lexicon with Diatheses
Anna Vernerová
|
Václava Kettnerová
|
Markéta Lopatková
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
Valency lexicons typically describe only unmarked usages of verbs (the active form); however, verbs prototypically enter different surface structures. In this paper, we focus on the so-called diatheses, i.e., the relations between different surface syntactic manifestations of verbs that are brought about by changes in the morphological category of voice, e.g., the passive diathesis. The change in voice of a verb is prototypically associated with shifts of some of its valency complementations in the surface structure. These shifts are implied by changes in morphemic forms of the involved valency complementations and are regular enough to be captured by syntactic rules. However, as diatheses are lexically conditioned, their applicability to an individual lexical unit of a verb is not predictable from its valency frame alone. In this work, we propose a representation of this linguistic phenomenon in a valency lexicon of Czech verbs, VALLEX, with the aim to enhance this lexicon with the information on individual types of Czech diatheses. In order to reduce the amount of necessary manual annotation, a semi-automatic method is developed. This method draws evidence from a large morphologically annotated corpus, relying on grammatical constraints on the applicability of individual types of diatheses.
2013
pdf
bib
The Representation of Czech Light Verb Constructions in a Valency Lexicon
Václava Kettnerová
|
Markéta Lopatková
Proceedings of the Second International Conference on Dependency Linguistics (DepLing 2013)
pdf
bib
A Case Study of a Free Word Order
Vladislav Kuboň
|
Markéta Lopatková
|
Jiří Mírovský
Proceedings of the 27th Pacific Asia Conference on Language, Information, and Computation (PACLIC 27)
2010
pdf
bib
Proceedings of the ACL 2010 Student Research Workshop
Seniz Demir
|
Jan Raab
|
Nils Reiter
|
Marketa Lopatkova
|
Tomek Strzalkowski
Proceedings of the ACL 2010 Student Research Workshop
pdf
bib
abs
Mapping between Dependency Structures and Compositional Semantic Representations
Max Jakob
|
Markéta Lopatková
|
Valia Kordoni
Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)
This paper investigates the mapping between two semantic formalisms, namely the tectogrammatical layer of the Prague Dependency Treebank 2.0 (PDT) and (Robust) Minimal Recursion Semantics ((R)MRS). It is a first attempt to relate the dependency-based annotation scheme of PDT to a compositional semantics approach like (R)MRS. A mapping algorithm that converts PDT trees to (R)MRS structures is developed, associating (R)MRSs to each node on the dependency tree. Furthermore, composition rules are formulated and the relation between dependency in PDT and semantic heads in (R)MRS is analyzed. It turns out that structure and dependencies, morphological categories and some coreferences can be preserved in the target structures. Moreover, valency and free modifications are distinguished using the valency dictionary of PDT as an additional resource. The validation results show that systematically correct underspecified target representations can be obtained by a rule-based mapping approach, which is an indicator that (R)MRS is indeed robust in relation to the formal representation of Czech data. This finding is novel, for Czech, with its free word order and rich morphology, is typologically different than languages analyzed with (R)MRS to date.
2009
pdf
bib
Annotation of Sentence Structure; Capturing the Relationship among Clauses in Czech Sentences
Markéta Lopatková
|
Natalia Klyueva
|
Petr Homola
Proceedings of the Third Linguistic Annotation Workshop (LAW III)
2006
pdf
bib
abs
Valency Lexicon of Czech Verbs: Alternation-Based Model
Markéta Lopatková
|
Zdeněk Žabokrtský
|
Karolina Skwarska
Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06)
The main objective of this paper is to introduce an alternation-based model of valency lexicon of Czech verbs VALLEX. Alternations describe regular changes in valency structure of verbs -- they are seen as transformations taking one lexical unit and return a modified lexical unit as a result. We characterize and exemplify syntactically-based and semantically-based' alternations and their effects on verb argument structure. The alternation-based model allows to distinguish a minimal form of lexicon, which provides compact characterization of valency structure of Czech verbs, and an expanded form of lexicon useful for some applications.
2004
pdf
bib
Valency Frames of Czech Verbs in VALLEX 1.0
Zdeněk Žabokrtský
|
Markéta Lopatková
Proceedings of the Workshop Frontiers in Corpus Annotation at HLT-NAACL 2004
2002
pdf
bib
Valency Dictionary of Czech Verbs: Complex Tectogrammatical Annotation
Markéta Straňáková-Lopatková
|
Zdenĕk Žabokrtský
Proceedings of the Third International Conference on Language Resources and Evaluation (LREC’02)