Giovana Meloni Craveiro

2026

Robustness and Diversity Evaluation on ProsSegue-ML: a Free Prosodic Segmentation Tool for Brazilian Portuguese
Giovana Meloni Craveiro | Sandra Maria Aluísio
Proceedings of the 17th International Conference on Computational Processing of Portuguese (PROPOR 2026) - Vol. 2

Prosodic segmentation is the task of dividing a sound unit into smaller units, which can be distinguished between units with a completed idea, marked by TBs, and non-autonomous units, marked by NTBs. It is a useful task to enhance the performance of ASR and TTs systems, and it remains relevant for Brazilian Portuguese due to the diversity of conditions and speaker-related factors that influence its performance. Here, we explore a low-impact, open-source approach based on a Random Forest classifier and a set of features that include fundamental frequency, speech rate, pauses, and energy (Craveiro et al., 2025). We perform a robustness evaluation of the referred ML model, modifying a few conditions on its training, comparing its performance when tested in other datasets, and comparing its results with those of other studies using the same data samples. We experiment with augmenting the training dataset and evaluating how the bias of speaker profile aspects is affected when the size and diversity of the training set are changed. Although we don’t achieve statistically significant values in the bias evaluation, we observe that inequalities grow as the training dataset is expanded with a much larger, but less diverse sample of data.

2024

pdf bib

Simple and Fast Automatic Prosodic Segmentation of Brazilian Portuguese Spontaneous Speech
Giovana Meloni Craveiro | Vinicius Gonçalves Santos | Gabriel Jose Pellisser Dalalana | Flaviane R. Fernandes Svartman | Sandra Maria Aluísio
Proceedings of the 16th International Conference on Computational Processing of Portuguese - Vol. 1

Co-authors

Venues

PROPOR2

Fix author