Evaluating Small Language Models for English-to-Portuguese Translation: Impact of Model Scale and Quantization

Gustavo Lopes Tamiosso; Rafael Oleques Nunes; Dennis Giovani Balreira

Evaluating Small Language Models for English-to-Portuguese Translation: Impact of Model Scale and Quantization

Gustavo Lopes Tamiosso, Rafael Oleques Nunes, Dennis Giovani Balreira

Abstract

Small language models (SLMs) are increasingly adopted for machine translation due to their lower computational and deployment costs, yet a focused and systematic evaluation for English-to-Portuguese remains limited. We benchmarked dozens of SLMs (135M–20B parameters) across multiple architectures and quantization schemes (FP16, Q8_0, Q4_K_M) on two datasets: FLORES-101 (Portuguese subset, 1,012 sentences) and the multidomain OPUS-100 dataset (~10k sentences). We computed lexical and semantic metrics (BLEU, chrF, and BERTScore) and assessed statistical differences using non-parametric Friedman tests over paired sentence-level scores, followed by Wilcoxon signed-rank post-hoc comparisons with Holm correction. Normality assumptions are evaluated using the Shapiro–Wilk test. Our results strongly suggest that 8-bit quantization (Q8_0) preserves semantic quality with negligible average loss, while 4-bit quantization (Q4_K_M) reaches statistical significance in roughly half of model configurations, paired effect sizes (Cliff’s δ) remain negligible to small in magnitude, with measurable degradation concentrated in lower-capacity models. Model scale exhibits only a weak correlation with translation quality: medium-sized models can match or outperform larger ones depending on model family and pretraining. These findings highlight trade-offs between efficiency and quality and inform the design of practical English–to-Portuguese translation pipelines based on SLMs.

Anthology ID:: 2026.propor-1.94
Volume:: Proceedings of the 17th International Conference on Computational Processing of Portuguese (PROPOR 2026) - Vol. 1
Month:: April
Year:: 2026
Address:: Salvador, Brazil
Editors:: Marlo Souza, Iria de-Dios-Flores, Diana Santos, Larissa Freitas, Jackson Wilke da Cruz Souza, Eugénio Ribeiro
Venue:: PROPOR
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 943–952
Language:
URL:: https://aclanthology.org/2026.propor-1.94/
DOI:
Bibkey:
Cite (ACL):: Gustavo Lopes Tamiosso, Rafael Oleques Nunes, and Dennis Giovani Balreira. 2026. Evaluating Small Language Models for English-to-Portuguese Translation: Impact of Model Scale and Quantization. In Proceedings of the 17th International Conference on Computational Processing of Portuguese (PROPOR 2026) - Vol. 1, pages 943–952, Salvador, Brazil. Association for Computational Linguistics.
Cite (Informal):: Evaluating Small Language Models for English-to-Portuguese Translation: Impact of Model Scale and Quantization (Tamiosso et al., PROPOR 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.propor-1.94.pdf

PDF Cite Search Fix data