Contrastive and Adversarial Disentanglement for Speaker Representations in Brazilian Portuguese

Ariadne Nascimento Matos; Arnaldo Candido Junior; Moacir Antonelli Ponti

Contrastive and Adversarial Disentanglement for Speaker Representations in Brazilian Portuguese

Ariadne Nascimento Matos, Arnaldo Candido Junior, Moacir Antonelli Ponti

Abstract

In this work, we study disentanglement between speaker and environment by combining an adversarial framework with contrastive learning objectives. We investigate supervised contrastive learning (SupCon), which exploits environment labels to structure the environment subspace, and self-supervised SimCLR, which learns invariance from augmented views. Experiments on a controlled synthetic dataset (ST1) and a more realistic corpus (CML-TTS) show that SupCon yields the most discriminative and stable speaker embeddings on ST1, achieving the best verification performance (EER=4.70%, MinDCF=0.24). Overall, our findings emphasize (i) the importance of synthetic benchmarks for diagnosing disentanglement under controlled factor variation and (ii) the effectiveness of combining contrastive and adversarial objectives to encourage speaker representations that are both discriminative and less sensitive to environmental factors.

Anthology ID:: 2026.propor-1.58
Volume:: Proceedings of the 17th International Conference on Computational Processing of Portuguese (PROPOR 2026) - Vol. 1
Month:: April
Year:: 2026
Address:: Salvador, Brazil
Editors:: Marlo Souza, Iria de-Dios-Flores, Diana Santos, Larissa Freitas, Jackson Wilke da Cruz Souza, Eugénio Ribeiro
Venue:: PROPOR
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 591–600
Language:
URL:: https://aclanthology.org/2026.propor-1.58/
DOI:
Bibkey:
Cite (ACL):: Ariadne Nascimento Matos, Arnaldo Candido Junior, and Moacir Antonelli Ponti. 2026. Contrastive and Adversarial Disentanglement for Speaker Representations in Brazilian Portuguese. In Proceedings of the 17th International Conference on Computational Processing of Portuguese (PROPOR 2026) - Vol. 1, pages 591–600, Salvador, Brazil. Association for Computational Linguistics.
Cite (Informal):: Contrastive and Adversarial Disentanglement for Speaker Representations in Brazilian Portuguese (Matos et al., PROPOR 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.propor-1.58.pdf

PDF Cite Search Fix data