Azadeh Mirzaei
2018
Persian Discourse Treebank and coreference corpus
Azadeh Mirzaei
|
Pegah Safari
Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)
2016
Persian Proposition Bank
Azadeh Mirzaei
|
Amirsaeid Moloodi
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
This paper describes the procedure of semantic role labeling and the development of the first manually annotated Persian Proposition Bank (PerPB) which added a layer of predicate-argument information to the syntactic structures of Persian Dependency Treebank (known as PerDT). Through the process of annotating, the annotators could see the syntactic information of all the sentences and so they annotated 29982 sentences with more than 9200 unique verbs. In the annotation procedure, the direct syntactic dependents of the verbs were the first candidates for being annotated. So we did not annotate the other indirect dependents unless their phrasal heads were propositional and had their own arguments or adjuncts. Hence besides the semantic role labeling of verbs, the argument structure of 1300 unique propositional nouns and 300 unique propositional adjectives were annotated in the sentences, too. The accuracy of annotation process was measured by double annotation of the data at two separate stages and finally the data was prepared in the CoNLL dependency format.