T. Florian Jaeger

Also published as: Florian Jaeger


2019

pdf bib
Modeling Long-Distance Cue Integration in Spoken Word Recognition
Wednesday Bushong | T. Florian Jaeger
Proceedings of the Workshop on Cognitive Modeling and Computational Linguistics

Cues to linguistic categories are distributed across the speech signal. Optimal categorization thus requires that listeners maintain gradient representations of incoming input in order to integrate that information with later cues. There is now evidence that listeners can and do integrate cues that occur far apart in time. Computational models of this integration have however been lacking. We take a first step at addressing this gap by mathematically formalizing four models of how listeners may maintain and use cue information during spoken language understanding and test them on two perception experiments. In one experiment, we find support for rational integration of cues at long distances. In a second, more memory and attention-taxing experiment, we find evidence in favor of a switching model that avoids maintaining detailed representations of cues in memory. These results are a first step in understanding what kinds of mechanisms listeners use for cue integration under different memory and attentional constraints.

2017

pdf bib
Grounding sound change in ideal observer models of perception
Zachary Burchill | T. Florian Jaeger
Proceedings of the 7th Workshop on Cognitive Modeling and Computational Linguistics (CMCL 2017)

An important predictor of historical sound change, functional load, fails to capture insights from speech perception. Building on ideal observer models of word recognition, we devise a new definition of functional load that incorporates both a priori predictability and perceptual information. We explore this new measure with a simple model and find that it outperforms traditional measures.

2014

pdf bib
Investigating the role of entropy in sentence processing
Tal Linzen | Florian Jaeger
Proceedings of the Fifth Workshop on Cognitive Modeling and Computational Linguistics

pdf bib
Biases in Predicting the Human Language Model
Alex B. Fine | Austin F. Frank | T. Florian Jaeger | Benjamin Van Durme
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)

2011

pdf bib
A Bayesian Belief Updating Model of Phonetic Recalibration and Selective Adaptation
Dave Kleinschmidt | T. Florian Jaeger
Proceedings of the 2nd Workshop on Cognitive Modeling and Computational Linguistics

2010

pdf bib
A Database for the Exploration of Spanish Planning
Carlos Gómez Gallo | T. Florian Jaeger | Katrina Furth
Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)

We describe a new task-based corpus in the Spanish language. The corpus consists of videos, transcripts, and annotations of the inter- action between a naive speaker and a confederate listener. The speaker instructs the listener to MOVE, ROTATE, or PAINT objects on a computer screen. This resource can be used to study how participants produce instructions in a collaborative goal-oriented scenario, in Spanish. The data set is ideally suited for investigating incremental processes of the production and interpretation of language. We demonstrate here how to use this corpus to explore language-specific differences in utterance planning, for English and Spanish speakers.

pdf bib
Syntactic Adaptation in Language Comprehension
Alex Fine | Ting Qian | T. Florian Jaeger | Robert Jacobs
Proceedings of the 2010 Workshop on Cognitive Modeling and Computational Linguistics

pdf bib
Close = Relevant? The Role of Context in Efficient Language Production
Ting Qian | T. Florian Jaeger
Proceedings of the 2010 Workshop on Cognitive Modeling and Computational Linguistics

2008

pdf bib
Production in a Multimodal Corpus: how Speakers Communicate Complex Actions
Carlos Gómez Gallo | T. Florian Jaeger | James Allen | Mary Swift
Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)

We describe a new multimodal corpus currently under development. The corpus consists of videos of task-oriented dialogues that are annotated for speaker’s verbal requests and domain action executions. This resource provides data for new research on language production and comprehension. The corpus can be used to study speakers’ decisions as to how to structure their utterances given the complexity of the message they are trying to convey.