Adrien Guille
2024
Exploring Semantics in Pretrained Language Model Attention
Frédéric Charpentier
|
Jairo Cugliari
|
Adrien Guille
Proceedings of the 13th Joint Conference on Lexical and Computational Semantics (*SEM 2024)
Abstract Meaning Representations (AMRs) encode the semantics of sentences in the form of graphs. Vertices represent instances of concepts, and labeled edges represent semantic relations between those instances. Language models (LMs) operate by computing weights of edges of per layer complete graphs whose vertices are words in a sentence or a whole paragraph. In this work, we investigate the ability of the attention heads of two LMs, RoBERTa and GPT2, to detect the semantic relations encoded in an AMR. This is an attempt to show semantic capabilities of those models without finetuning. To do so, we apply both unsupervised and supervised learning techniques.
Search