A new European Portuguese corpus for the study of Psychosis through speech analysis

Maria Forjó, Daniel Neto, Alberto Abad, HSofia Pinto, Joaquim Gago


Abstract
Psychosis is a clinical syndrome characterized by the presence of symptoms such as hallucinations, thought disorder and disorganized speech. Several studies have used machine learning, combined with speech and natural language processing methods to aid in the diagnosis process of this disease. This paper describes the creation of the first European Portuguese corpus for the identification of the presence of speech characteristics of psychosis, which contains samples of 92 participants, 56 controls and 36 individuals diagnosed with psychosis and medicated. The corpus was used in a set of experiments that allowed identifying the most promising feature set to perform the classification: the combination of acoustic and speech metric features. Several classifiers were implemented to study which ones entailed the best performance depending on the task and feature set. The most promising results obtained for the entire corpus were achieved when identifying individuals with a Multi-Layer Perceptron classifier and reached an 87.5% accuracy. Focusing on the gender dependent results, the overall best results were 90.9% and 82.9% accuracy, for female and male subjects respectively. Lastly, the experiments performed lead us to conjecture that spontaneous speech presents more identifiable characteristics than read speech to differentiate healthy and patients diagnosed with psychosis.
Anthology ID:
2022.lrec-1.793
Volume:
Proceedings of the Thirteenth Language Resources and Evaluation Conference
Month:
June
Year:
2022
Address:
Marseille, France
Editors:
Nicoletta Calzolari, Frédéric Béchet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
7298–7304
Language:
URL:
https://aclanthology.org/2022.lrec-1.793
DOI:
Bibkey:
Cite (ACL):
Maria Forjó, Daniel Neto, Alberto Abad, HSofia Pinto, and Joaquim Gago. 2022. A new European Portuguese corpus for the study of Psychosis through speech analysis. In Proceedings of the Thirteenth Language Resources and Evaluation Conference, pages 7298–7304, Marseille, France. European Language Resources Association.
Cite (Informal):
A new European Portuguese corpus for the study of Psychosis through speech analysis (Forjó et al., LREC 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.lrec-1.793.pdf