Proceedings of the 2024 CLASP Conference on Multimodality and Interaction in Language Learning

Amy Qiu, Bill Noble, David Pagmar, Vladislav Maraev, Nikolai Ilinykh (Editors)


Anthology ID:
2024.clasp-1
Month:
October
Year:
2024
Address:
Gothenburg, Sweden
Venue:
CLASP
SIG:
SIGSEM
Publisher:
Association for Computational Linguistics
URL:
https://aclanthology.org/2024.clasp-1/
DOI:
Bib Export formats:
BibTeX MODS XML EndNote
PDF:
https://aclanthology.org/2024.clasp-1.pdf

pdf bib
Proceedings of the 2024 CLASP Conference on Multimodality and Interaction in Language Learning
Amy Qiu | Bill Noble | David Pagmar | Vladislav Maraev | Nikolai Ilinykh

pdf bib
Critical Size Hypothesis: How Model Hyperparameters Correlate with Its Linguistic Abilities
Ekaterina Voloshina | Oleg Serikov

In recent years, the models were tested on different probing tasks to examine their language knowledge. However, few researchers explored the very process of models’ language acquisition. Nevertheless, the analysis of language acquisition during training could shed light on the model parameters that help to acquire the language faster. In this work, we experiment with model hyperparameters and reveal that the hidden size is the most essential factor for model language acquisition.

pdf bib
INIKOL - Collocational Database for Learning Croatian as a Foreign Language
Goranka Blagus Bartolec | Gorana Duplančić Rogošić | Antonia Ordulj

This paper describes the ongoing work on the INIKOL project - the development of a collocation database for learning Croatian as a foreign language. The main goal of the project is to contribute to easier mastery of collocations as fixed phrases in Croatian as a foreign language.

pdf bib
How Does an Adjective Sound Like? Exploring Audio Phrase Composition with Textual Embeddings
Saba Nazir | Mehrnoosh Sadrzadeh

We learn matrix representations for the fre- quent sound-relevant adjectives of English and compose them with vector representations of their nouns. The matrices are learnt jointly from audio and textual data, via linear regres- sion and tensor skipgram. They are assessed using an adjective similarity benchmark and also a novel adjective-noun phrase similarity dataset, applied to two tasks: semantic similar- ity and audio similarity. Joint learning via Ten- sor Skipgram (TSG) outperforms audio-only models, matrix composition outperforms addi- tion and non compositional phrase vectors.

pdf bib
Learning through gesture: embodied repetitions in tandem interactions
Loulou Kosmala

Grounded in an interactional approach, this corpus-based study presents an analysis of multimodal tandem interactions held in English between tandem partners (L1 and L2 speakers) to study other-repetitions across different levels and modalities. In particular, I investigate cases of embodied repetitions in contexts of co-construction and repair whereby tandem partners negotiate meaning. Based on careful micro-analyses of data fragments, analyses reveal different types of temporal coordination between the repetition of the target item and/or of the gesture, addressing specific issues at different linguistic levels. While repetitions typically occur in linguistic-oriented contexts, emerging gestures may further contribute to mutual understanding and alignment.

pdf bib
Towards Automated Game-Based Early Screening for Language Disorder
Hamdan Hamid Al-Ali | Elsa Soares | Goncalo Leal | Rita Valente | Nicole Agrela | Alexandra Marquis | Hanan Aldarmaki

This paper examines the potential of gamifying early childhood language disorder screening to make the process more accessible and scalable. We provide an overview of current practices in screening and assessment, and a description of our on-going work towards automation of early screening. By integrating developmental milestones into a video game format and employing automatic speech recognition and natural language processing, this approach aims to enhance the efficiency and reach of early screening in order to identify children who need further professional assessment.

pdf bib
L2 Interaction in Heterogeneous Learner Groups during Content and Language Integrated Learning: The Experience of (removed for peer-review) and beyond
Julia Edeleva | Martin Neef | Jiaming Liu | Martin Scheidt

Сontent and language integrated learning is considered a powerful tool to promote inclusion in educational settings of learners for whom the language of instruction is their additional language. Language-related difficulties of those learners have been claimed detrimental for attaining personal educational goals. Academic language places increased cognitive demands on the learning process in general due to 1) its internal complexity; 2) L2 speakers’ lower proficiency; 3) their disadvantage in terms of real-time processing. Facilitators are, therefore, encouraged to integrate interactional CLIL-elements (e.g., scaffolding) during content instruction that provide the necessary pedagogical support for better understanding of disciplinary concepts and their interrelation. In the current contribution, we present the concept and first results of Rail.lexis, a collaborative project of the Department of German Studies and the Department of Railway Engineering at TU Brauschweig. We present and discuss several conversational arrangements (e.g., word guessing games, a differential task matrix) that were designed to engage the learners of heterogeneous linguistic backgrounds in meaningful interactions in subject-specific classes. Subject-specific tasks are gradient regarding their cognitive complexity and the background knowledge required to solve them. Therefore, the linguistic repertoire required to negotiate different task types is also differential to ensure the participation of linguistically diverse students in language-enhanced classroom interactions.

pdf bib
Fifty shapes of BLiMP: syntactic learning curves in language models are not uniform, but sometimes unruly
Bastian Bunzeck | Sina Zarrieß

Syntactic learning curves in LMs are usually reported as relatively stable and power law-shaped. By analyzing the learning curves of different LMs on various syntactic phenomena using both small self-trained llama models and larger pre-trained pythia models, we show that while many phenomena do follow typical power law curves, others exhibit S-shaped, U-shaped, or erratic patterns. Certain syntactic paradigms remain challenging even for large models, resulting in persistent preference for ungrammatical sentences. Most phenomena show similar curves for their paradigms, but the existence of diverging patterns and oscillations indicates that average curves mask important developments, underscoring the need for more detailed analyses of individual learning trajectories.

pdf bib
Not Just Semantics: Word Meaning Negotiation in Social Media and Spoken Interaction
Staffan Larsson | Jenny Myrendal | Bill Noble

This paper outlines the ongoing research project “Not Just Semantics: Word Meaning Negotiation in Social Media and Spoken Interaction”. The goal of the project is to investigate how meanings of words (and phrases) are interactively negotiated in social media and in spoken interaction. This project will contribute towards a comprehensive theory of word meaning negotiation.

pdf bib
Toward Real Time Word Based Prosody Recognition
Alex Tilson | Frank Foerster

Prosodic salience is a heuristic based on word-level prosody in child-directed speech that is thought to serve as a cue for attentional focus. It has been used in the context of robotic language acquisition to extract the contextually most relevant words from a human tutor’s speech to ground them in a robot’s sensorimotor data. However, the pipeline for performing word-based prosody-recognition operated in a semi-automatic manner and required substantial manual effort. We describe our efforts to automate the existing pipeline by including real time prosody recognition, and a modern speech recognition and forced alignment model. The intention is to enable its use in real time for human-in-the-loop robotic language acquisition and other socially driven forms of online learning.