uppdf
bib
Proceedings of the 2024 CLASP Conference on Multimodality and Interaction in Language Learning
Amy Qiu
|
Bill Noble
|
David Pagmar
|
Vladislav Maraev
|
Nikolai Ilinykh
pdf
bib
abs
Critical Size Hypothesis: How Model Hyperparameters Correlate with Its Linguistic Abilities
Ekaterina Voloshina
|
Oleg Serikov
In recent years, the models were tested on different probing tasks to examine their language knowledge. However, few researchers explored the very process of models’ language acquisition. Nevertheless, the analysis of language acquisition during training could shed light on the model parameters that help to acquire the language faster. In this work, we experiment with model hyperparameters and reveal that the hidden size is the most essential factor for model language acquisition.
pdf
bib
abs
INIKOL - Collocational Database for Learning Croatian as a Foreign Language
Goranka Blagus Bartolec
|
Gorana Duplančić Rogošić
|
Antonia Ordulj
This paper describes the ongoing work on the INIKOL project - the development of a collocation database for learning Croatian as a foreign language. The main goal of the project is to contribute to easier mastery of collocations as fixed phrases in Croatian as a foreign language.
pdf
bib
abs
How Does an Adjective Sound Like? Exploring Audio Phrase Composition with Textual Embeddings
Saba Nazir
|
Mehrnoosh Sadrzadeh
We learn matrix representations for the fre- quent sound-relevant adjectives of English and compose them with vector representations of their nouns. The matrices are learnt jointly from audio and textual data, via linear regres- sion and tensor skipgram. They are assessed using an adjective similarity benchmark and also a novel adjective-noun phrase similarity dataset, applied to two tasks: semantic similar- ity and audio similarity. Joint learning via Ten- sor Skipgram (TSG) outperforms audio-only models, matrix composition outperforms addi- tion and non compositional phrase vectors.
pdf
bib
abs
Learning through gesture: embodied repetitions in tandem interactions
Loulou Kosmala
Grounded in an interactional approach, this corpus-based study presents an analysis of multimodal tandem interactions held in English between tandem partners (L1 and L2 speakers) to study other-repetitions across different levels and modalities. In particular, I investigate cases of embodied repetitions in contexts of co-construction and repair whereby tandem partners negotiate meaning. Based on careful micro-analyses of data fragments, analyses reveal different types of temporal coordination between the repetition of the target item and/or of the gesture, addressing specific issues at different linguistic levels. While repetitions typically occur in linguistic-oriented contexts, emerging gestures may further contribute to mutual understanding and alignment.
pdf
bib
abs
Towards Automated Game-Based Early Screening for Language Disorder
Hamdan Hamid Al-Ali
|
Elsa Soares
|
Goncalo Leal
|
Rita Valente
|
Nicole Agrela
|
Alexandra Marquis
|
Hanan Aldarmaki
This paper examines the potential of gamifying early childhood language disorder screening to make the process more accessible and scalable. We provide an overview of current practices in screening and assessment, and a description of our on-going work towards automation of early screening. By integrating developmental milestones into a video game format and employing automatic speech recognition and natural language processing, this approach aims to enhance the efficiency and reach of early screening in order to identify children who need further professional assessment.
pdf
bib
abs
L2 Interaction in Heterogeneous Learner Groups during Content and Language Integrated Learning: The Experience of (removed for peer-review) and beyond
Julia Edeleva
|
Martin Neef
|
Jiaming Liu
|
Martin Scheidt
Сontent and language integrated learning is considered a powerful tool to promote inclusion in educational settings of learners for whom the language of instruction is their additional language. Language-related difficulties of those learners have been claimed detrimental for attaining personal educational goals. Academic language places increased cognitive demands on the learning process in general due to 1) its internal complexity; 2) L2 speakers’ lower proficiency; 3) their disadvantage in terms of real-time processing. Facilitators are, therefore, encouraged to integrate interactional CLIL-elements (e.g., scaffolding) during content instruction that provide the necessary pedagogical support for better understanding of disciplinary concepts and their interrelation. In the current contribution, we present the concept and first results of Rail.lexis, a collaborative project of the Department of German Studies and the Department of Railway Engineering at TU Brauschweig. We present and discuss several conversational arrangements (e.g., word guessing games, a differential task matrix) that were designed to engage the learners of heterogeneous linguistic backgrounds in meaningful interactions in subject-specific classes. Subject-specific tasks are gradient regarding their cognitive complexity and the background knowledge required to solve them. Therefore, the linguistic repertoire required to negotiate different task types is also differential to ensure the participation of linguistically diverse students in language-enhanced classroom interactions.
pdf
bib
abs
Fifty shapes of BLiMP: syntactic learning curves in language models are not uniform, but sometimes unruly
Bastian Bunzeck
|
Sina Zarrieß
Syntactic learning curves in LMs are usually reported as relatively stable and power law-shaped. By analyzing the learning curves of different LMs on various syntactic phenomena using both small self-trained llama models and larger pre-trained pythia models, we show that while many phenomena do follow typical power law curves, others exhibit S-shaped, U-shaped, or erratic patterns. Certain syntactic paradigms remain challenging even for large models, resulting in persistent preference for ungrammatical sentences. Most phenomena show similar curves for their paradigms, but the existence of diverging patterns and oscillations indicates that average curves mask important developments, underscoring the need for more detailed analyses of individual learning trajectories.
pdf
bib
abs
Not Just Semantics: Word Meaning Negotiation in Social Media and Spoken Interaction
Staffan Larsson
|
Jenny Myrendal
|
Bill Noble
This paper outlines the ongoing research project “Not Just Semantics: Word Meaning Negotiation in Social Media and Spoken Interaction”. The goal of the project is to investigate how meanings of words (and phrases) are interactively negotiated in social media and in spoken interaction. This project will contribute towards a comprehensive theory of word meaning negotiation.
pdf
bib
abs
Toward Real Time Word Based Prosody Recognition
Alex Tilson
|
Frank Foerster
Prosodic salience is a heuristic based on word-level prosody in child-directed speech that is thought to serve as a cue for attentional focus. It has been used in the context of robotic language acquisition to extract the contextually most relevant words from a human tutor’s speech to ground them in a robot’s sensorimotor data. However, the pipeline for performing word-based prosody-recognition operated in a semi-automatic manner and required substantial manual effort. We describe our efforts to automate the existing pipeline by including real time prosody recognition, and a modern speech recognition and forced alignment model. The intention is to enable its use in real time for human-in-the-loop robotic language acquisition and other socially driven forms of online learning.