From ‘It’s All Greek to Me’ to ‘Nur Bahnhof Verstehen’: An Investigation of mBERT’s Cross-Linguistic Capabilities

Aria Rastegar, Pegah Ramezani


Abstract
This study investigates the impact of cross-linguistic similarities on idiom representation in mBERT, focusing on English and German idioms categorized by different degrees of similarity. We aim to determine whether different degrees of cross-linguistic similarities significantly affect mBERT’s representations and to observe how these representations change across its 12 layers. Contrary to our initial hypothesis, cross-linguistic similarity did not uniformly impact idiom representations across all layers. While early and middle layers showed no significant differences among idiom categories, higher layers (from Layer 8 onwards) revealed more nuanced processing. Specifically, significant differences between the control category and idioms with similar meaning (SM), as well as between idioms with similar lexical items (SL) and those with similar semantics (SM) were observed. Our analysis revealed that early layers provided general representations, while higher layers showed increased differentiation between literal and figurative meanings. This was evidenced by a general decrease in cosine similarities from Layer 5 onwards, with Layer 8 demonstrating the lowest cosine similarities across all categories. Interestingly, a trend suggests that mBERT performs slightly better with more literal hints. The order of cosine similarity for the categorizations was: idioms with a degree of formal similarity, control idioms, idioms with both formal and semantic similarity, and finally idioms with only semantic similarity. These findings indicate that mBERT’s processing of idioms evolves significantly across its layers, with cross-linguistic might affect more significantly in higher layers where more abstract semantic processing likely occurs.
Anthology ID:
2024.clicit-1.87
Volume:
Proceedings of the 10th Italian Conference on Computational Linguistics (CLiC-it 2024)
Month:
December
Year:
2024
Address:
Pisa, Italy
Editors:
Felice Dell'Orletta, Alessandro Lenci, Simonetta Montemagni, Rachele Sprugnoli
Venue:
CLiC-it
SIG:
Publisher:
CEUR Workshop Proceedings
Note:
Pages:
805–812
Language:
URL:
https://aclanthology.org/2024.clicit-1.87/
DOI:
Bibkey:
Cite (ACL):
Aria Rastegar and Pegah Ramezani. 2024. From ‘It’s All Greek to Me’ to ‘Nur Bahnhof Verstehen’: An Investigation of mBERT’s Cross-Linguistic Capabilities. In Proceedings of the 10th Italian Conference on Computational Linguistics (CLiC-it 2024), pages 805–812, Pisa, Italy. CEUR Workshop Proceedings.
Cite (Informal):
From ‘It’s All Greek to Me’ to ‘Nur Bahnhof Verstehen’: An Investigation of mBERT’s Cross-Linguistic Capabilities (Rastegar & Ramezani, CLiC-it 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.clicit-1.87.pdf