The Contextualized Representation of Collocation

Liu Daohuan, Tang Xuri


Abstract
“Collocate list and collocation network are two widely used representation methods of colloca-tions, but they have significant weaknesses in representing contextual information. To solve thisproblem, we propose a new representation method, namely the contextualized representation ofcollocate (CRC), which highlights the importance of the position of the collocates and pins acollocate as the interaction of two dimensions: association strength and co-occurrence position. With a full image of all the collocates surrounding the node word, CRC carries the contextualinformation and makes the representation more informative and intuitive. Through three casestudies, i.e., synonym distinction, image analysis, and efficiency in lexical use, we demonstratethe advantages of CRC in practical applications. CRC is also a new quantitative tool to measurelexical usage pattern similarities for corpus-based research. It can provide a new representationframework for language researchers and learners.”
Anthology ID:
2023.ccl-1.71
Volume:
Proceedings of the 22nd Chinese National Conference on Computational Linguistics
Month:
August
Year:
2023
Address:
Harbin, China
Editors:
Maosong Sun, Bing Qin, Xipeng Qiu, Jing Jiang, Xianpei Han
Venue:
CCL
SIG:
Publisher:
Chinese Information Processing Society of China
Note:
Pages:
836–846
Language:
English
URL:
https://aclanthology.org/2023.ccl-1.71
DOI:
Bibkey:
Cite (ACL):
Liu Daohuan and Tang Xuri. 2023. The Contextualized Representation of Collocation. In Proceedings of the 22nd Chinese National Conference on Computational Linguistics, pages 836–846, Harbin, China. Chinese Information Processing Society of China.
Cite (Informal):
The Contextualized Representation of Collocation (Daohuan & Xuri, CCL 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.ccl-1.71.pdf