FBK@IWSLT 2012 – ASR track

D. Falavigna, R. Gretter, F. Brugnara, D. Giuliani


Abstract
This paper reports on the participation of FBK at the IWSLT2012 evaluation campaign on automatic speech recognition: namely in the English ASR track. Both primary and contrastive submissions have been sent for evaluation. The ASR system features acoustic models trained on a portion of the TED talk recordings that was automatically selected according to the fidelity of the provided transcriptions. Three decoding steps are performed interleaved by acoustic feature normalization and acoustic model adaptation. A final rescoring step, based on the usage of an interpolated language model, is applied to word graphs generated in the third decoding step. For the primary submission, language models entering the interpolation are trained on both out-of-domain and in-domain text data, instead the contrastive submission uses both ”general purpose” and auxiliary language models trained only on out-of-domain text data. Despite this fact, similar performance are obtained with the two submissions.
Anthology ID:
2012.iwslt-evaluation.9
Volume:
Proceedings of the 9th International Workshop on Spoken Language Translation: Evaluation Campaign
Month:
December 6-7
Year:
2012
Address:
Hong Kong, Table of contents
Venue:
IWSLT
SIG:
SIGSLT
Publisher:
Note:
Pages:
81–86
Language:
URL:
https://aclanthology.org/2012.iwslt-evaluation.9
DOI:
Bibkey:
Cite (ACL):
D. Falavigna, R. Gretter, F. Brugnara, and D. Giuliani. 2012. FBK@IWSLT 2012 – ASR track. In Proceedings of the 9th International Workshop on Spoken Language Translation: Evaluation Campaign, pages 81–86, Hong Kong, Table of contents.
Cite (Informal):
FBK@IWSLT 2012 – ASR track (Falavigna et al., IWSLT 2012)
Copy Citation:
PDF:
https://aclanthology.org/2012.iwslt-evaluation.9.pdf