ItGraSyll: A Computational Analysis of Graphical Syllabification and Stress Assignment in Italian

Liviu Dinu, Ioan-Bogdan Iordache, Simona Georgescu, Alina Maria Cristea, Bianca Guita


Abstract
In this paper we build a dataset of Italian syllables. We perform quantitative and qualitative analyses on the syllabification and stress assignment in Italian. We propose a machine learning model, based on deep-learning techniques, for automatically inferring syllabification and stress assignment. For stress prediction we report 94.45% word-level accuracy, and for syllabification we report 98.41% word-level accuracy and 99.82% hyphen-level accuracy.
Anthology ID:
2024.clicit-1.38
Volume:
Proceedings of the 10th Italian Conference on Computational Linguistics (CLiC-it 2024)
Month:
December
Year:
2024
Address:
Pisa, Italy
Editors:
Felice Dell'Orletta, Alessandro Lenci, Simonetta Montemagni, Rachele Sprugnoli
Venue:
CLiC-it
SIG:
Publisher:
CEUR Workshop Proceedings
Note:
Pages:
316–324
Language:
URL:
https://aclanthology.org/2024.clicit-1.38/
DOI:
Bibkey:
Cite (ACL):
Liviu Dinu, Ioan-Bogdan Iordache, Simona Georgescu, Alina Maria Cristea, and Bianca Guita. 2024. ItGraSyll: A Computational Analysis of Graphical Syllabification and Stress Assignment in Italian. In Proceedings of the 10th Italian Conference on Computational Linguistics (CLiC-it 2024), pages 316–324, Pisa, Italy. CEUR Workshop Proceedings.
Cite (Informal):
ItGraSyll: A Computational Analysis of Graphical Syllabification and Stress Assignment in Italian (Dinu et al., CLiC-it 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.clicit-1.38.pdf