2022
pdf
bib
abs
Modeling Consistency Preference via Lexical Chains for Document-level Neural Machine Translation
Xinglin Lyu
|
Junhui Li
|
Shimin Tao
|
Hao Yang
|
Ying Qin
|
Min Zhang
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing
In this paper we aim to relieve the issue of lexical translation inconsistency for document-level neural machine translation (NMT) by modeling consistency preference for lexical chains, which consist of repeated words in a source-side document and provide a representation of the lexical consistency structure of the document. Specifically, we first propose lexical-consistency attention to capture consistency context among words in the same lexical chains. Then for each lexical chain we define and learn a consistency-tailored latent variable, which will guide the translation of corresponding sentences to enhance lexical translation consistency. Experimental results on Chinese→English and French→English document-level translation tasks show that our approach not only significantly improves translation performance in BLEU, but also substantially alleviates the problem of the lexical translation inconsistency.
pdf
bib
abs
HW-TSC’s Submissions to the WMT 2022 General Machine Translation Shared Task
Daimeng Wei
|
Zhiqiang Rao
|
Zhanglin Wu
|
Shaojun Li
|
Yuanchang Luo
|
Yuhao Xie
|
Xiaoyu Chen
|
Hengchao Shang
|
Zongyao Li
|
Zhengzhe Yu
|
Jinlong Yang
|
Miaomiao Ma
|
Lizhi Lei
|
Hao Yang
|
Ying Qin
Proceedings of the Seventh Conference on Machine Translation (WMT)
This paper presents the submissions of Huawei Translate Services Center (HW-TSC) to the WMT 2022 General Machine Translation Shared Task. We participate in 6 language pairs, including Zh↔En, Ru↔En, Uk↔En, Hr↔En, Uk↔Cs and Liv↔En. We use Transformer architecture and obtain the best performance via multiple variants with larger parameter sizes. We perform fine-grained pre-processing and filtering on the provided large-scale bilingual and monolingual datasets. For medium and highresource languages, we mainly use data augmentation strategies, including Back Translation, Self Training, Ensemble Knowledge Distillation, Multilingual, etc. For low-resource languages such as Liv, we use pre-trained machine translation models, and then continue training with Regularization Dropout (R-Drop). The previous mentioned data augmentation methods are also used. Our submissions obtain competitive results in the final evaluation.
pdf
bib
abs
Exploring Robustness of Machine Translation Metrics: A Study of Twenty-Two Automatic Metrics in the WMT22 Metric Task
Xiaoyu Chen
|
Daimeng Wei
|
Hengchao Shang
|
Zongyao Li
|
Zhanglin Wu
|
Zhengzhe Yu
|
Ting Zhu
|
Mengli Zhu
|
Ning Xie
|
Lizhi Lei
|
Shimin Tao
|
Hao Yang
|
Ying Qin
Proceedings of the Seventh Conference on Machine Translation (WMT)
Contextual word embeddings extracted from pre-trained models have become the basis for many downstream NLP tasks, including machine translation automatic evaluations. Metrics that leverage embeddings claim better capture of synonyms and changes in word orders, and thus better correlation with human ratings than surface-form matching metrics (e.g. BLEU). However, few studies have been done to examine robustness of these metrics. This report uses a challenge set to uncover the brittleness of reference-based and reference-free metrics. Our challenge set1 aims at examining metrics’ capability to correlate synonyms in different areas and to discern catastrophic errors at both word- and sentence-levels. The results show that although embedding-based metrics perform relatively well on discerning sentence-level negation/affirmation errors, their performances on relating synonyms are poor. In addition, we find that some metrics are susceptible to text styles so their generalizability compromised.
pdf
bib
abs
Partial Could Be Better than Whole. HW-TSC 2022 Submission for the Metrics Shared Task
Yilun Liu
|
Xiaosong Qiao
|
Zhanglin Wu
|
Su Chang
|
Min Zhang
|
Yanqing Zhao
|
Song Peng
|
Shimin Tao
|
Hao Yang
|
Ying Qin
|
Jiaxin Guo
|
Minghan Wang
|
Yinglu Li
|
Peng Li
|
Xiaofeng Zhao
Proceedings of the Seventh Conference on Machine Translation (WMT)
In this paper, we present the contribution of HW-TSC to WMT 2022 Metrics Shared Task. We propose one reference-based metric, HWTSC-EE-BERTScore*, and four referencefree metrics including HWTSC-Teacher-Sim, HWTSC-TLM, KG-BERTScore and CROSSQE. Among these metrics, HWTSC-Teacher-Sim and CROSS-QE are supervised, whereas HWTSC-EE-BERTScore*, HWTSC-TLM and KG-BERTScore are unsupervised. We use these metrics in the segment-level and systemlevel tracks. Overall, our systems achieve strong results for all language pairs on previous test sets and a new state-of-the-art in many sys-level case sets.
pdf
bib
abs
HW-TSC’s Submission for the WMT22 Efficiency Task
Hengchao Shang
|
Ting Hu
|
Daimeng Wei
|
Zongyao Li
|
Xianzhi Yu
|
Jianfei Feng
|
Ting Zhu
|
Lizhi Lei
|
Shimin Tao
|
Hao Yang
|
Ying Qin
|
Jinlong Yang
|
Zhiqiang Rao
|
Zhengzhe Yu
Proceedings of the Seventh Conference on Machine Translation (WMT)
This paper presents the submission of Huawei Translation Services Center (HW-TSC) to WMT 2022 Efficiency Shared Task. For this year’s task, we still apply sentence-level distillation strategy to train small models with different configurations. Then, we integrate the average attention mechanism into the lightweight RNN model to pursue more efficient decoding. We tried adding a retrain step to our 8-bit and 4-bit models to achieve a balance between model size and quality. We still use Huawei Noah’s Bolt for INT8 inference and 4-bit storage. Coupled with Bolt’s support for batch inference and multi-core parallel computing, we finally submit models with different configurations to the CPU latency and throughput tracks to explore the Pareto frontiers.
pdf
bib
abs
HW-TSC Translation Systems for the WMT22 Biomedical Translation Task
Zhanglin Wu
|
Jinlong Yang
|
Zhiqiang Rao
|
Zhengzhe Yu
|
Daimeng Wei
|
Xiaoyu Chen
|
Zongyao Li
|
Hengchao Shang
|
Shaojun Li
|
Ming Zhu
|
Yuanchang Luo
|
Yuhao Xie
|
Miaomiao Ma
|
Ting Zhu
|
Lizhi Lei
|
Song Peng
|
Hao Yang
|
Ying Qin
Proceedings of the Seventh Conference on Machine Translation (WMT)
This paper describes the translation systems trained by Huawei translation services center (HW-TSC) for the WMT22 biomedical translation task in five language pairs: English↔German (en↔de), English↔French (en↔fr), English↔Chinese (en↔zh), English↔Russian (en↔ru) and Spanish→English (es→en). Our primary systems are built on deep Transformer with a large filter size. We also utilize R-Drop, data diversification, forward translation, back translation, data selection, finetuning and ensemble to improve the system performance. According to the official evaluation results in OCELoT or CodaLab, our unconstrained systems in en→de, de→en, en→fr, fr→en, en→zh and es→en (clinical terminology sub-track) get the highest BLEU scores among all submissions for the WMT22 biomedical translation task.
pdf
bib
abs
HW-TSC Translation Systems for the WMT22 Chat Translation Task
Jinlong Yang
|
Zongyao Li
|
Daimeng Wei
|
Hengchao Shang
|
Xiaoyu Chen
|
Zhengzhe Yu
|
Zhiqiang Rao
|
Shaojun Li
|
Zhanglin Wu
|
Yuhao Xie
|
Yuanchang Luo
|
Ting Zhu
|
Yanqing Zhao
|
Lizhi Lei
|
Hao Yang
|
Ying Qin
Proceedings of the Seventh Conference on Machine Translation (WMT)
This paper describes the submissions of Huawei Translation Services Center (HW-TSC) to WMT22 chat translation shared task on English-Germany (en-de) bidirection with results of zore-shot and few-shot tracks. We use the deep transformer architecture with a lager parameter size. Our submissions to the WMT21 News Translation task are used as the baselines. We adopt strategies such as back translation, forward translation, domain transfer, data selection, and noisy forward translation in task, and achieve competitive results on the development set. We also test the effectiveness of document translation on chat tasks. Due to the lack of chat data, the results on the development set show that it is not as effective as sentence-level translation models.
pdf
bib
abs
HW-TSC Systems for WMT22 Very Low Resource Supervised MT Task
Shaojun Li
|
Yuanchang Luo
|
Daimeng Wei
|
Zongyao Li
|
Hengchao Shang
|
Xiaoyu Chen
|
Zhanglin Wu
|
Jinlong Yang
|
Zhiqiang Rao
|
Zhengzhe Yu
|
Yuhao Xie
|
Lizhi Lei
|
Hao Yang
|
Ying Qin
Proceedings of the Seventh Conference on Machine Translation (WMT)
This paper describes the submissions of Huawei translation services center (HW-TSC) to the WMT22 Very Low Resource Supervised MT task. We participate in all 6 supervised tracks including all combinations between Upper/Lower Sorbian (Hsb/Dsb) and German (De). Our systems are build on deep Transformer with a large filter size. We use multilingual transfer with German-Czech (De-Cs) and German-Polish (De-Pl) parallel data. We also utilize regularized dropout (R-Drop), back translation, fine-tuning and ensemble to improve the system performance. According to the official evaluation results on OCELoT, our supervised systems of all 6 language directions get the highest BLEU scores among all submissions. Our pre-trained multilingual model for unsupervised De2Dsb and Dsb2De translation also gain highest BLEU.
pdf
bib
abs
HW-TSC’s Submissions to the WMT22 Word-Level Auto Completion Task
Hao Yang
|
Hengchao Shang
|
Zongyao Li
|
Daimeng Wei
|
Xianghui He
|
Xiaoyu Chen
|
Zhengzhe Yu
|
Jiaxin Guo
|
Jinlong Yang
|
Shaojun Li
|
Yuanchang Luo
|
Yuhao Xie
|
Lizhi Lei
|
Ying Qin
Proceedings of the Seventh Conference on Machine Translation (WMT)
This paper presents the submissions of Huawei Translation Services Center (HW-TSC) to WMT 2022 Word-Level AutoCompletion Task. We propose an end-to-end autoregressive model with bi-context based on Transformer to solve current task. The model uses a mixture of subword and character encoding units to realize the joint encoding of human input, the context of the target side and the decoded sequence, which ensures full utilization of information. We uses one model to solve four types of data structures in the task. During training, we try using a machine translation model as the pre-trained model and fine-tune it for the task. We also add BERT-style MLM data at the fine-tuning stage to improve model performance. We participate in zh→en, en→de, and de→en directions and win the first place in all the three tracks. Particularly, we outperform the second place by more than 5% in terms of accuracy on the zh→en and en→de tracks. The result is buttressed by human evaluations as well, demonstrating the effectiveness of our model.
pdf
bib
abs
The HW-TSC’s Offline Speech Translation System for IWSLT 2022 Evaluation
Yinglu Li
|
Minghan Wang
|
Jiaxin Guo
|
Xiaosong Qiao
|
Yuxia Wang
|
Daimeng Wei
|
Chang Su
|
Yimeng Chen
|
Min Zhang
|
Shimin Tao
|
Hao Yang
|
Ying Qin
Proceedings of the 19th International Conference on Spoken Language Translation (IWSLT 2022)
This paper describes the HW-TSC’s designation of the Offline Speech Translation System submitted for IWSLT 2022 Evaluation. We explored both cascade and end-to-end system on three language tracks (en-de, en-zh and en-ja), and we chose the cascade one as our primary submission. For the automatic speech recognition (ASR) model of cascade system, there are three ASR models including Conformer, S2T-Transformer and U2 trained on the mixture of five datasets. During inference, transcripts are generated with the help of domain controlled generation strategy. Context-aware reranking and ensemble based anti-interference strategy are proposed to produce better ASR outputs. For machine translation part, we pretrained three translation models on WMT21 dataset and fine-tuned them on in-domain corpora. Our cascade system shows competitive performance than the known offline systems in the industry and academia.
pdf
bib
abs
The HW-TSC’s Simultaneous Speech Translation System for IWSLT 2022 Evaluation
Minghan Wang
|
Jiaxin Guo
|
Yinglu Li
|
Xiaosong Qiao
|
Yuxia Wang
|
Zongyao Li
|
Chang Su
|
Yimeng Chen
|
Min Zhang
|
Shimin Tao
|
Hao Yang
|
Ying Qin
Proceedings of the 19th International Conference on Spoken Language Translation (IWSLT 2022)
This paper presents our work in the participation of IWSLT 2022 simultaneous speech translation evaluation. For the track of text-to-text (T2T), we participate in three language pairs and build wait-k based simultaneous MT (SimulMT) model for the task. The model was pretrained on WMT21 news corpora, and was further improved with in-domain fine-tuning and self-training. For the speech-to-text (S2T) track, we designed both cascade and end-to-end form in three language pairs. The cascade system is composed of a chunking-based streaming ASR model and the SimulMT model used in the T2T track. The end-to-end system is a simultaneous speech translation (SimulST) model based on wait-k strategy, which is directly trained on a synthetic corpus produced by translating all texts of ASR corpora into specific target language with an offline MT model. It also contains a heuristic sentence breaking strategy, preventing it from finishing the translation before the the end of the speech. We evaluate our systems on the MUST-C tst-COMMON dataset and show that the end-to-end system is competitive to the cascade one. Meanwhile, we also demonstrate that the SimulMT model can be efficiently optimized by these approaches, resulting in the improvements of 1-2 BLEU points.
pdf
bib
abs
The HW-TSC’s Speech to Speech Translation System for IWSLT 2022 Evaluation
Jiaxin Guo
|
Yinglu Li
|
Minghan Wang
|
Xiaosong Qiao
|
Yuxia Wang
|
Hengchao Shang
|
Chang Su
|
Yimeng Chen
|
Min Zhang
|
Shimin Tao
|
Hao Yang
|
Ying Qin
Proceedings of the 19th International Conference on Spoken Language Translation (IWSLT 2022)
The paper presents the HW-TSC’s pipeline and results of Offline Speech to Speech Translation for IWSLT 2022. We design a cascade system consisted of an ASR model, machine translation model and TTS model to convert the speech from one language into another language(en-de). For the ASR part, we find that better performance can be obtained by ensembling multiple heterogeneous ASR models and performing reranking on beam candidates. And we find that the combination of context-aware reranking strategy and MT model fine-tuned on the in-domain dataset is helpful to improve the performance. Because it can mitigate the problem that the inconsistency in transcripts caused by the lack of context. Finally, we use VITS model provided officially to reproduce audio files from the translation hypothesis.
pdf
bib
abs
HW-TSC’s Participation in the IWSLT 2022 Isometric Spoken Language Translation
Zongyao Li
|
Jiaxin Guo
|
Daimeng Wei
|
Hengchao Shang
|
Minghan Wang
|
Ting Zhu
|
Zhanglin Wu
|
Zhengzhe Yu
|
Xiaoyu Chen
|
Lizhi Lei
|
Hao Yang
|
Ying Qin
Proceedings of the 19th International Conference on Spoken Language Translation (IWSLT 2022)
This paper presents our submissions to the IWSLT 2022 Isometric Spoken Language Translation task. We participate in all three language pairs (English-German, English-French, English-Spanish) under the constrained setting, and submit an English-German result under the unconstrained setting. We use the standard Transformer model as the baseline and obtain the best performance via one of its variants that shares the decoder input and output embedding. We perform detailed pre-processing and filtering on the provided bilingual data. Several strategies are used to train our models, such as Multilingual Translation, Back Translation, Forward Translation, R-Drop, Average Checkpoint, and Ensemble. We investigate three methods for biasing the output length: i) conditioning the output to a given target-source length-ratio class; ii) enriching the transformer positional embedding with length information and iii) length control decoding for non-autoregressive translation etc. Our submissions achieve 30.7, 41.6 and 36.7 BLEU respectively on the tst-COMMON test sets for English-German, English-French, English-Spanish tasks and 100% comply with the length requirements.
2021
pdf
bib
abs
HW-TSC’s Participation in the WMT 2021 News Translation Shared Task
Daimeng Wei
|
Zongyao Li
|
Zhanglin Wu
|
Zhengzhe Yu
|
Xiaoyu Chen
|
Hengchao Shang
|
Jiaxin Guo
|
Minghan Wang
|
Lizhi Lei
|
Min Zhang
|
Hao Yang
|
Ying Qin
Proceedings of the Sixth Conference on Machine Translation
This paper presents the submission of Huawei Translate Services Center (HW-TSC) to the WMT 2021 News Translation Shared Task. We participate in 7 language pairs, including Zh/En, De/En, Ja/En, Ha/En, Is/En, Hi/Bn, and Xh/Zu in both directions under the constrained condition. We use Transformer architecture and obtain the best performance via multiple variants with larger parameter sizes. We perform detailed pre-processing and filtering on the provided large-scale bilingual and monolingual datasets. Several commonly used strategies are used to train our models, such as Back Translation, Forward Translation, Multilingual Translation, Ensemble Knowledge Distillation, etc. Our submission obtains competitive results in the final evaluation.
pdf
bib
abs
HW-TSC’s Participation in the WMT 2021 Triangular MT Shared Task
Zongyao Li
|
Daimeng Wei
|
Hengchao Shang
|
Xiaoyu Chen
|
Zhanglin Wu
|
Zhengzhe Yu
|
Jiaxin Guo
|
Minghan Wang
|
Lizhi Lei
|
Min Zhang
|
Hao Yang
|
Ying Qin
Proceedings of the Sixth Conference on Machine Translation
This paper presents the submission of Huawei Translation Service Center (HW-TSC) to WMT 2021 Triangular MT Shared Task. We participate in the Russian-to-Chinese task under the constrained condition. We use Transformer architecture and obtain the best performance via a variant with larger parameter sizes. We perform detailed data pre-processing and filtering on the provided large-scale bilingual data. Several strategies are used to train our models, such as Multilingual Translation, Back Translation, Forward Translation, Data Denoising, Average Checkpoint, Ensemble, Fine-tuning, etc. Our system obtains 32.5 BLEU on the dev set and 27.7 BLEU on the test set, the highest score among all submissions.
pdf
bib
abs
HW-TSC’s Participation in the WMT 2021 Large-Scale Multilingual Translation Task
Zhengzhe Yu
|
Daimeng Wei
|
Zongyao Li
|
Hengchao Shang
|
Xiaoyu Chen
|
Zhanglin Wu
|
Jiaxin Guo
|
Minghan Wang
|
Lizhi Lei
|
Min Zhang
|
Hao Yang
|
Ying Qin
Proceedings of the Sixth Conference on Machine Translation
This paper presents the submission of Huawei Translation Services Center (HW-TSC) to the WMT 2021 Large-Scale Multilingual Translation Task. We participate in Samll Track #2, including 6 languages: Javanese (Jv), Indonesian (Id), Malay (Ms), Tagalog (Tl), Tamil (Ta) and English (En) with 30 directions under the constrained condition. We use Transformer architecture and obtain the best performance via multiple variants with larger parameter sizes. We train a single multilingual model to translate all the 30 directions. We perform detailed pre-processing and filtering on the provided large-scale bilingual and monolingual datasets. Several commonly used strategies are used to train our models, such as Back Translation, Forward Translation, Ensemble Knowledge Distillation, Adapter Fine-tuning. Our model obtains competitive results in the end.
pdf
bib
abs
HW-TSC’s Participation in the WMT 2021 Efficiency Shared Task
Hengchao Shang
|
Ting Hu
|
Daimeng Wei
|
Zongyao Li
|
Jianfei Feng
|
ZhengZhe Yu
|
Jiaxin Guo
|
Shaojun Li
|
Lizhi Lei
|
ShiMin Tao
|
Hao Yang
|
Jun Yao
|
Ying Qin
Proceedings of the Sixth Conference on Machine Translation
This paper presents the submission of Huawei Translation Services Center (HW-TSC) to WMT 2021 Efficiency Shared Task. We explore the sentence-level teacher-student distillation technique and train several small-size models that find a balance between efficiency and quality. Our models feature deep encoder, shallow decoder and light-weight RNN with SSRU layer. We use Huawei Noah’s Bolt, an efficient and light-weight library for on-device inference. Leveraging INT8 quantization, self-defined General Matrix Multiplication (GEMM) operator, shortlist, greedy search and caching, we submit four small-size and efficient translation models with high translation quality for the one CPU core latency track.
pdf
bib
abs
HW-TSC’s Submissions to the WMT21 Biomedical Translation Task
Hao Yang
|
Zhanglin Wu
|
Zhengzhe Yu
|
Xiaoyu Chen
|
Daimeng Wei
|
Zongyao Li
|
Hengchao Shang
|
Minghan Wang
|
Jiaxin Guo
|
Lizhi Lei
|
Chuanfei Xu
|
Min Zhang
|
Ying Qin
Proceedings of the Sixth Conference on Machine Translation
This paper describes the submission of Huawei Translation Service Center (HW-TSC) to WMT21 biomedical translation task in two language pairs: Chinese↔English and German↔English (Our registered team name is HuaweiTSC). Technical details are introduced in this paper, including model framework, data pre-processing method and model enhancement strategies. In addition, using the wmt20 OK-aligned biomedical test set, we compare and analyze system performances under different strategies. On WMT21 biomedical translation task, Our systems in English→Chinese and English→German directions get the highest BLEU scores among all submissions according to the official evaluation results.
2020
pdf
bib
abs
HW-TSC’s Participation in the WMT 2020 News Translation Shared Task
Daimeng Wei
|
Hengchao Shang
|
Zhanglin Wu
|
Zhengzhe Yu
|
Liangyou Li
|
Jiaxin Guo
|
Minghan Wang
|
Hao Yang
|
Lizhi Lei
|
Ying Qin
|
Shiliang Sun
Proceedings of the Fifth Conference on Machine Translation
This paper presents our work in the WMT 2020 News Translation Shared Task. We participate in 3 language pairs including Zh/En, Km/En, and Ps/En and in both directions under the constrained condition. We use the standard Transformer-Big model as the baseline and obtain the best performance via two variants with larger parameter sizes. We perform detailed pre-processing and filtering on the provided large-scale bilingual and monolingual dataset. Several commonly used strategies are used to train our models such as Back Translation, Ensemble Knowledge Distillation, etc. We also conduct experiment with similar language augmentation, which lead to positive results, although not used in our submission. Our submission obtains remarkable results in the final evaluation.
pdf
bib
abs
HW-TSC’s Participation at WMT 2020 Automatic Post Editing Shared Task
Hao Yang
|
Minghan Wang
|
Daimeng Wei
|
Hengchao Shang
|
Jiaxin Guo
|
Zongyao Li
|
Lizhi Lei
|
Ying Qin
|
Shimin Tao
|
Shiliang Sun
|
Yimeng Chen
Proceedings of the Fifth Conference on Machine Translation
The paper presents the submission by HW-TSC in the WMT 2020 Automatic Post Editing Shared Task. We participate in the English-German and English-Chinese language pairs. Our system is built based on the Transformer pre-trained on WMT 2019 and WMT 2020 News Translation corpora, and fine-tuned on the APE corpus. Bottleneck Adapter Layers are integrated into the model to prevent over-fitting. We further collect external translations as the augmented MT candidates to improve the performance. The experiment demonstrates that pre-trained NMT models are effective when fine-tuning with the APE corpus of a limited size, and the performance can be further improved with external MT augmentation. Our system achieves competitive results on both directions in the final evaluation.
pdf
bib
abs
HW-TSC’s Participation at WMT 2020 Quality Estimation Shared Task
Minghan Wang
|
Hao Yang
|
Hengchao Shang
|
Daimeng Wei
|
Jiaxin Guo
|
Lizhi Lei
|
Ying Qin
|
Shimin Tao
|
Shiliang Sun
|
Yimeng Chen
|
Liangyou Li
Proceedings of the Fifth Conference on Machine Translation
This paper presents our work in the WMT 2020 Word and Sentence-Level Post-Editing Quality Estimation (QE) Shared Task. Our system follows standard Predictor-Estimator architecture, with a pre-trained Transformer as the Predictor, and specific classifiers and regressors as Estimators. We integrate Bottleneck Adapter Layers in the Predictor to improve the transfer learning efficiency and prevent from over-fitting. At the same time, we jointly train the word- and sentence-level tasks with a unified model with multitask learning. Pseudo-PE assisted QE (PEAQE) is proposed, resulting in significant improvements on the performance. Our submissions achieve competitive result in word/sentence-level sub-tasks for both of En-De/Zh language pairs.
pdf
bib
abs
Efficient Transfer Learning for Quality Estimation with Bottleneck Adapter Layer
Hao Yang
|
Minghan Wang
|
Ning Xie
|
Ying Qin
|
Yao Deng
Proceedings of the 22nd Annual Conference of the European Association for Machine Translation
The Predictor-Estimator framework for quality estimation (QE) is commonly used for its strong performance. Where the predictor and estimator works on feature extraction and quality evaluation, respectively. However, training the predictor from scratch is computationally expensive. In this paper, we propose an efficient transfer learning framework to transfer knowledge from NMT dataset into QE models. A Predictor-Estimator alike model named BAL-QE is also proposed, aiming to extract high quality features with pre-trained NMT model, and make classification with a fine-tuned Bottleneck Adapter Layer (BAL). The experiment shows that BAL-QE achieves 97% of the SOTA performance in WMT19 En-De and En-Ru QE tasks by only training 3% of parameters within 4 hours on 4 Titan XP GPUs. Compared with the commonly used NuQE baseline, BAL-QE achieves 47% (En-Ru) and 75% (En-De) of performance promotions.
pdf
bib
abs
Unified Humor Detection Based on Sentence-pair Augmentation and Transfer Learning
Minghan Wang
|
Hao Yang
|
Ying Qin
|
Shiliang Sun
|
Yao Deng
Proceedings of the 22nd Annual Conference of the European Association for Machine Translation
We propose a unified multilingual model for humor detection which can be trained under a transfer learning framework. 1) The model is built based on pre-trained multilingual BERT, thereby is able to make predictions on Chinese, Russian and Spanish corpora. 2) We step out from single sentence classification and propose sequence-pair prediction which considers the inter-sentence relationship. 3) We propose the Sentence Discrepancy Prediction (SDP) loss, aiming to measure the semantic discrepancy of the sequence-pair, which often appears in the setup and punchline of a joke. Our method achieves two SoTA and a second-place on three humor detection corpora in three languages (Russian, Spanish and Chinese), and also improves F1-score by 4%-6%, which demonstrates the effectiveness of it in humor detection tasks.
pdf
bib
abs
HW-TSC’s Participation in the WAT 2020 Indic Languages Multilingual Task
Zhengzhe Yu
|
Zhanglin Wu
|
Xiaoyu Chen
|
Daimeng Wei
|
Hengchao Shang
|
Jiaxin Guo
|
Zongyao Li
|
Minghan Wang
|
Liangyou Li
|
Lizhi Lei
|
Hao Yang
|
Ying Qin
Proceedings of the 7th Workshop on Asian Translation
This paper describes our work in the WAT 2020 Indic Multilingual Translation Task. We participated in all 7 language pairs (En<->Bn/Hi/Gu/Ml/Mr/Ta/Te) in both directions under the constrained condition—using only the officially provided data. Using transformer as a baseline, our Multi->En and En->Multi translation systems achieve the best performances. Detailed data filtering and data domain selection are the keys to performance enhancement in our experiment, with an average improvement of 2.6 BLEU scores for each language pair in the En->Multi system and an average improvement of 4.6 BLEU scores regarding the Multi->En. In addition, we employed language independent adapter to further improve the system performances. Our submission obtains competitive results in the final evaluation.
pdf
bib
abs
The HW-TSC Video Speech Translation System at IWSLT 2020
Minghan Wang
|
Hao Yang
|
Yao Deng
|
Ying Qin
|
Lizhi Lei
|
Daimeng Wei
|
Hengchao Shang
|
Ning Xie
|
Xiaochun Li
|
Jiaxian Guo
Proceedings of the 17th International Conference on Spoken Language Translation
The paper presents details of our system in the IWSLT Video Speech Translation evaluation. The system works in a cascade form, which contains three modules: 1) A proprietary ASR system. 2) A disfluency correction system aims to remove interregnums or other disfluent expressions with a fine-tuned BERT and a series of rule-based algorithms. 3) An NMT System based on the Transformer and trained with massive publicly available corpus.
2015
pdf
bib
Truly Exploring Multiple References for Machine Translation Evaluation
Ying Qin
|
Lucia Specia
Proceedings of the 18th Annual Conference of the European Association for Machine Translation
pdf
bib
Testing and Comparing Computational Approaches for Identifying the Language of Framing in Political News
Eric Baumer
|
Elisha Elovic
|
Ying Qin
|
Francesca Polletta
|
Geri Gay
Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
pdf
bib
Truly Exploring Multiple References for Machine Translation Evaluation
Ying Qin
|
Lucia Specia
Proceedings of the 18th Annual Conference of the European Association for Machine Translation
2011
pdf
bib
Forward-backward Machine Transliteration between English and Chinese Based on Combined CRFs
Ying Qin
|
GuoHua Chen
Proceedings of the 3rd Named Entities Workshop (NEWS 2011)
2008
pdf
bib
BUPT Systems in the SIGHAN Bakeoff 2007
Ying Qin
|
Caixia Yuan
|
Jiashen Sun
|
Xiaojie Wang
Proceedings of the Sixth SIGHAN Workshop on Chinese Language Processing
2006
pdf
bib
Word Segmentation and Named Entity Recognition for SIGHAN Bakeoff3
Suxiang Zhang
|
Ying Qin
|
Juan Wen
|
Xiaojie Wang
Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing