Compressing Noun Phrases to Discover Mental Constructions in Corpora – A Case Study for Auxiliaries in Hungarian

Balázs Indig, Tímea Borbála Bajzát


Abstract
The quantitative turn in functional linguistics has emphasised the importance of data-oriented methods in describing linguistic patterns. However, there are significant differences between constructions and the examples they cover, which need to be properly formalised. For example, noun chains introduce significant variation in the examples, making it difficult to identify underlying patterns. The compression of noun chains into their minimal form (e.g. as they appear in abstract constructions) is a promising method for revealing linguistic patterns in corpora through their examples. This method, combined with identifying the appropriate level of abstraction for the additional elements present, allows for the systematic extraction of good construction candidates. A pilot has been developed for Hungarian infinitive structures, but is adaptable for various linguistic structures and other agglutinative languages.
Anthology ID:
2024.iwclul-1.12
Volume:
Proceedings of the 9th International Workshop on Computational Linguistics for Uralic Languages
Month:
November
Year:
2024
Address:
Helsinki, Finland
Editors:
Mika Hämäläinen, Flammie Pirinen, Melany Macias, Mario Crespo Avila
Venue:
IWCLUL
SIG:
SIGUR
Publisher:
Association for Computational Linguistics
Note:
Pages:
96–103
Language:
URL:
https://aclanthology.org/2024.iwclul-1.12
DOI:
Bibkey:
Cite (ACL):
Balázs Indig and Tímea Borbála Bajzát. 2024. Compressing Noun Phrases to Discover Mental Constructions in Corpora – A Case Study for Auxiliaries in Hungarian. In Proceedings of the 9th International Workshop on Computational Linguistics for Uralic Languages, pages 96–103, Helsinki, Finland. Association for Computational Linguistics.
Cite (Informal):
Compressing Noun Phrases to Discover Mental Constructions in Corpora – A Case Study for Auxiliaries in Hungarian (Indig & Borbála Bajzát, IWCLUL 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.iwclul-1.12.pdf