Large Language Models in Linguistic Research: the Pilot and the Copilot

Svetla Koeva


Abstract
In this paper, we present two experiments focussing on linguistic classification and annotation of examples, using zero-shot prompting. The aim is to show how large language models can confirm or reject the linguistic judgements of experts in order to increase the productivity of their work. In the first experiment, new lexical units evoking a particular FrameNet semantic frame are selected simultaneously with the annotation of examples with the core frame elements. The second experiment attempts to categorise verbs into the aspectual classes, assuming that only certain combinations of verbs belonging to different aspectual classes evoke a semantic frame. The linguistic theories underlying the two experiments, the development of the prompts and the results of the experiments are presented.
Anthology ID:
2024.clib-1.35
Volume:
Proceedings of the Sixth International Conference on Computational Linguistics in Bulgaria (CLIB 2024)
Month:
September
Year:
2024
Address:
Sofia, Bulgaria
Venue:
CLIB
SIG:
Publisher:
Department of Computational Linguistics, Institute for Bulgarian Language, Bulgarian Academy of Sciences
Note:
Pages:
319–328
Language:
URL:
https://aclanthology.org/2024.clib-1.35
DOI:
Bibkey:
Cite (ACL):
Svetla Koeva. 2024. Large Language Models in Linguistic Research: the Pilot and the Copilot. In Proceedings of the Sixth International Conference on Computational Linguistics in Bulgaria (CLIB 2024), pages 319–328, Sofia, Bulgaria. Department of Computational Linguistics, Institute for Bulgarian Language, Bulgarian Academy of Sciences.
Cite (Informal):
Large Language Models in Linguistic Research: the Pilot and the Copilot (Koeva, CLIB 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.clib-1.35.pdf