Combining Rule-based and Data-driven Techniques for Grammatical Relation Extraction in Spoken Language

Kenji Sagae, Alon Lavie


Abstract
We investigate an aspect of the relationship between parsing and corpus-based methods in NLP that has received relatively little attention: coverage augmentation in rule-based parsers. In the specific task of determining grammatical relations (such as subjects and objects) in transcribed spoken language, we show that a combination of rule-based and corpus-based approaches, where a rule-based system is used as the teacher (or an automatic data annotator) to a corpus-based system, outperforms either system in isolation.
Anthology ID:
W03-3019
Volume:
Proceedings of the Eighth International Conference on Parsing Technologies
Month:
April
Year:
2003
Address:
Nancy, France
Venue:
IWPT
SIG:
SIGPARSE
Publisher:
Note:
Pages:
Language:
URL:
https://aclanthology.org/W03-3019/
DOI:
Bibkey:
Cite (ACL):
Kenji Sagae and Alon Lavie. 2003. Combining Rule-based and Data-driven Techniques for Grammatical Relation Extraction in Spoken Language. In Proceedings of the Eighth International Conference on Parsing Technologies, Nancy, France.
Cite (Informal):
Combining Rule-based and Data-driven Techniques for Grammatical Relation Extraction in Spoken Language (Sagae & Lavie, IWPT 2003)
Copy Citation:
PDF:
https://aclanthology.org/W03-3019.pdf