Multilingual Sign Language Translation with Unified Datasets and Pose-Based Transformers

Pedro Alejandro Dal Bianco; Oscar Agustín Stanchi; Facundo Manuel Quiroga; Franco Ronchetti

Multilingual Sign Language Translation with Unified Datasets and Pose-Based Transformers

Pedro Alejandro Dal Bianco, Oscar Agustín Stanchi, Facundo Manuel Quiroga, Franco Ronchetti

Abstract

Sign languages are highly diverse across countries and regions, yet most Sign Language Translation (SLT) work remains monolingual. We explore a unified, multi-target SLT model trained jointly on four sign languages (German, Greek, Argentinian, Indian) using a standardized data layer. Our model operates on pose keypoints extracted with MediaPipe, yielding a lightweight and dataset-agnostic representation that is less sensitive to backgrounds, clothing, cameras, or signer identity while retaining motion and configuration cues. On RWTH-PHOENIX-Weather 2014T, Greek Sign Language Dataset, LSA-T, and ISLTranslate, naive joint training under a fully shared parameterization performs worse than monolingual baselines; however, a simple two-stage schedule: multilingual pre-training followed by a short language-specific fine-tuning, recovers and surpasses monolingual results on three datasets (PHOENIX14T: +0.15 BLEU-4; GSL: +0.74; ISL: +0.10) while narrowing the gap on the most challenging corpus (LSA-T: -0.24 vs. monolingual). Scores span from BLEU-4≈ 1 on open-domain news (LSA-T) to >90 on constrained curricula (GSL), highlighting the role of dataset complexity. We release our code to facilitate training and evaluation of multilingual SLT models.

Anthology ID:: 2025.wslp-main.5
Volume:: Proceedings of the Workshop on Sign Language Processing (WSLP)
Month:: December
Year:: 2025
Address:: IIT Bombay, Mumbai, India (Co-located with IJCNLP–AACL 2025)
Editors:: Mohammed Hasanuzzaman, Facundo Manuel Quiroga, Ashutosh Modi, Sabyasachi Kamila, Keren Artiaga, Abhinav Joshi, Sanjeet Singh
Venues:: WSLP | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 27–32
Language:
URL:: https://aclanthology.org/2025.wslp-main.5/
DOI:
Bibkey:
Cite (ACL):: Pedro Alejandro Dal Bianco, Oscar Agustín Stanchi, Facundo Manuel Quiroga, and Franco Ronchetti. 2025. Multilingual Sign Language Translation with Unified Datasets and Pose-Based Transformers. In Proceedings of the Workshop on Sign Language Processing (WSLP), pages 27–32, IIT Bombay, Mumbai, India (Co-located with IJCNLP–AACL 2025). Association for Computational Linguistics.
Cite (Informal):: Multilingual Sign Language Translation with Unified Datasets and Pose-Based Transformers (Bianco et al., WSLP 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.wslp-main.5.pdf

PDF Cite Search Fix data