Combining Language Resources Into A Grammar-Driven Swedish Parser

Malin Ahlberg, Ramona Enache


Abstract
This paper describes work on a rule-based, open-source parser for Swedish. The central component is a wide-coverage grammar implemented in the GF formalism (Grammatical Framework), a dependently typed grammar formalism based on Martin-Löf type theory. GF has strong support for multilinguality and has so far been used successfully for controlled languages and recent experiments have showed that it is also possible to use the framework for parsing unrestricted language. In addition to GF, we use two other main resources: the Swedish treebank Talbanken and the electronic lexicon SALDO. By combining the grammar with a lexicon extracted from SALDO we obtain a parser accepting all sentences described by the given rules. We develop and test this on examples from Talbanken. The resulting parser gives a full syntactic analysis of the input sentences. It will be highly reusable, freely available, and as GF provides libraries for compiling grammars to a number of programming languages, chosen parts of the the grammar may be used in various NLP applications.
Anthology ID:
L12-1176
Volume:
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
Month:
May
Year:
2012
Address:
Istanbul, Turkey
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Mehmet Uğur Doğan, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
1971–1976
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/360_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Malin Ahlberg and Ramona Enache. 2012. Combining Language Resources Into A Grammar-Driven Swedish Parser. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), pages 1971–1976, Istanbul, Turkey. European Language Resources Association (ELRA).
Cite (Informal):
Combining Language Resources Into A Grammar-Driven Swedish Parser (Ahlberg & Enache, LREC 2012)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/360_Paper.pdf