Enriching SCFG rules directly from efficient bilingual chart parsing

Martin Čmejrek, Bowen Zhou, Bing Xiang


Abstract
In this paper, we propose a new method for training translation rules for a Synchronous Context-free Grammar. A bilingual chart parser is used to generate the parse forest, and EM algorithm to estimate expected counts for each rule of the ruleset. Additional rules are constructed as combinations of reliable rules occurring in the parse forest. The new method of proposing additional translation rules is independent of word alignments. We present the theoretical background for this method, and initial experimental results on German-English translations of Europarl data.
Anthology ID:
2009.iwslt-papers.2
Volume:
Proceedings of the 6th International Workshop on Spoken Language Translation: Papers
Month:
December 1-2
Year:
2009
Address:
Tokyo, Japan
Venue:
IWSLT
SIG:
SIGSLT
Publisher:
Note:
Pages:
136–143
Language:
URL:
https://aclanthology.org/2009.iwslt-papers.2
DOI:
Bibkey:
Cite (ACL):
Martin Čmejrek, Bowen Zhou, and Bing Xiang. 2009. Enriching SCFG rules directly from efficient bilingual chart parsing. In Proceedings of the 6th International Workshop on Spoken Language Translation: Papers, pages 136–143, Tokyo, Japan.
Cite (Informal):
Enriching SCFG rules directly from efficient bilingual chart parsing (Čmejrek et al., IWSLT 2009)
Copy Citation:
PDF:
https://aclanthology.org/2009.iwslt-papers.2.pdf
Presentation:
 2009.iwslt-papers.2.Presentation.pdf