The Political Speech Corpus of Bulgarian

Petya Osenova, Kiril Simov


Abstract
The paper introduces the Political Speech Corpus of Bulgarian. First, its current state has been discussed with respect to its size, coverage, genre specification and related online services. Then, the focus goes to the annotation details. On the one hand, the layers of linguistic annotation are presented. On the other hand, the compatibility with CLARIN technical Infrastructure is explained. Also, some user-based scenarios are mentioned to demonstrate the corpus services and applicability.
Anthology ID:
L12-1569
Volume:
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
Month:
May
Year:
2012
Address:
Istanbul, Turkey
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Mehmet Uğur Doğan, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
1744–1747
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/956_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Petya Osenova and Kiril Simov. 2012. The Political Speech Corpus of Bulgarian. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), pages 1744–1747, Istanbul, Turkey. European Language Resources Association (ELRA).
Cite (Informal):
The Political Speech Corpus of Bulgarian (Osenova & Simov, LREC 2012)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/956_Paper.pdf