A Multimodal Framework for Aphasia Severity Classification in Russian

Kolmogorova Anastasia; Ekaterina Yavshitz; Anastasia Margolina; Anna Sugian

A Multimodal Framework for Aphasia Severity Classification in Russian

Kolmogorova Anastasia, Ekaterina Yavshitz, Anastasia Margolina, Anna Sugian

Abstract

Automatic classification of aphasia severity presents persistent challenges, particularly for languages with limited clinical speech resources such as Russian. This paper explores a multimodal approach to severity estimation that combines acoustic and semantic representations of pathological speech. Acoustic features are extracted using pretrained Wav2Vec 2.0 models, while semantic information is obtained from the encoder of the Whisper model. The two representations are integrated via early feature fusion and evaluated using gradient boosting classifiers in a speaker-independent cross-validation setting. Experiments are conducted on a newly collected dataset of Russian speech recordings from patients with aphasia and neurotypical speakers (RuAphasiaBank). The results suggest that the combined use of acoustic and semantic embeddings can provide more stable severity estimates than unimodal baselines. This study contributes empirical evidence on the applicability of multimodal representation learning for aphasia severity classification under data-scarce conditions.

Anthology ID:: 2026.healing-1.22
Volume:: Proceedings of the 1st Workshop on Linguistic Analysis for Health (HeaLing 2026)
Month:: March
Year:: 2026
Address:: Rabat, Morocco
Editors:: Vera Danilova, Murathan Kurfalı, Ylva Söderfeldt, Julia Reed, Andrew Burchell
Venues:: HeaLing | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 257–265
Language:
URL:: https://aclanthology.org/2026.healing-1.22/
DOI:
Bibkey:
Cite (ACL):: Kolmogorova Anastasia, Ekaterina Yavshitz, Anastasia Margolina, and Anna Sugian. 2026. A Multimodal Framework for Aphasia Severity Classification in Russian. In Proceedings of the 1st Workshop on Linguistic Analysis for Health (HeaLing 2026), pages 257–265, Rabat, Morocco. Association for Computational Linguistics.
Cite (Informal):: A Multimodal Framework for Aphasia Severity Classification in Russian (Anastasia et al., HeaLing 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.healing-1.22.pdf

PDF Cite Search Fix data