2024
pdf
bib
abs
SCOUT: A Situated and Multi-Modal Human-Robot Dialogue Corpus
Stephanie M. Lukin
|
Claire Bonial
|
Matthew Marge
|
Taylor A. Hudson
|
Cory J. Hayes
|
Kimberly Pollard
|
Anthony Baker
|
Ashley N. Foots
|
Ron Artstein
|
Felix Gervits
|
Mitchell Abrams
|
Cassidy Henry
|
Lucia Donatelli
|
Anton Leuski
|
Susan G. Hill
|
David Traum
|
Clare Voss
Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)
We introduce the Situated Corpus Of Understanding Transactions (SCOUT), a multi-modal collection of human-robot dialogue in the task domain of collaborative exploration. The corpus was constructed from multiple Wizard-of-Oz experiments where human participants gave verbal instructions to a remotely-located robot to move and gather information about its surroundings. SCOUT contains 89,056 utterances and 310,095 words from 278 dialogues averaging 320 utterances per dialogue. The dialogues are aligned with the multi-modal data streams available during the experiments: 5,785 images and 30 maps. The corpus has been annotated with Abstract Meaning Representation and Dialogue-AMR to identify the speaker’s intent and meaning within an utterance, and with Transactional Units and Relations to track relationships between utterances to reveal patterns of the Dialogue Structure. We describe how the corpus and its annotations have been used to develop autonomous human-robot systems and enable research in open questions of how humans speak to robots. We release this corpus to accelerate progress in autonomous, situated, human-robot dialogue, especially in the context of navigation tasks where details about the environment need to be discovered.
2023
pdf
bib
abs
Abstract Meaning Representation for Grounded Human-Robot Communication
Claire Bonial
|
Julie Foresta
|
Nicholas C. Fung
|
Cory J. Hayes
|
Philip Osteen
|
Jacob Arkin
|
Benned Hedegaard
|
Thomas Howard
Proceedings of the Fourth International Workshop on Designing Meaning Representations
To collaborate effectively in physically situated tasks, robots must be able to ground concepts in natural language to the physical objects in the environment as well as their own capabilities. We describe the implementation and the demonstration of a system architecture that sup- ports tasking robots using natural language. In this architecture, natural language instructions are first handled by a dialogue management component, which provides feedback to the user and passes executable instructions along to an Abstract Meaning Representation (AMR) parser. The parse distills the action primitives and parameters of the instructed behavior in the form of a directed a-cyclic graph, passed on to the grounding component. We find AMR to be an efficient formalism for grounding the nodes of the graph using a Distributed Correspondence Graph. Thus, in our approach, the concepts of language are grounded to entities in the robot’s world model, which is populated by its sensors, thereby enabling grounded natural language communication. The demonstration of this system will allow users to issue navigation commands in natural language to direct a simulated ground robot (running the Robot Operating System) to various landmarks observed by the user within a simulated environment.
2019
pdf
bib
abs
A Research Platform for Multi-Robot Dialogue with Humans
Matthew Marge
|
Stephen Nogar
|
Cory J. Hayes
|
Stephanie M. Lukin
|
Jesse Bloecker
|
Eric Holder
|
Clare Voss
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics (Demonstrations)
This paper presents a research platform that supports spoken dialogue interaction with multiple robots. The demonstration showcases our crafted MultiBot testing scenario in which users can verbally issue search, navigate, and follow instructions to two robotic teammates: a simulated ground robot and an aerial robot. This flexible language and robotic platform takes advantage of existing tools for speech recognition and dialogue management that are compatible with new domains, and implements an inter-agent communication protocol (tactical behavior specification), where verbal instructions are encoded for tasks assigned to the appropriate robot.
2018
pdf
bib
Dialogue Structure Annotation for Multi-Floor Interaction
David Traum
|
Cassidy Henry
|
Stephanie Lukin
|
Ron Artstein
|
Felix Gervits
|
Kimberly Pollard
|
Claire Bonial
|
Su Lei
|
Clare Voss
|
Matthew Marge
|
Cory Hayes
|
Susan Hill
Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)
pdf
bib
abs
ScoutBot: A Dialogue System for Collaborative Navigation
Stephanie M. Lukin
|
Felix Gervits
|
Cory J. Hayes
|
Pooja Moolchandani
|
Anton Leuski
|
John G. Rogers III
|
Carlos Sanchez Amaro
|
Matthew Marge
|
Clare R. Voss
|
David Traum
Proceedings of ACL 2018, System Demonstrations
ScoutBot is a dialogue interface to physical and simulated robots that supports collaborative exploration of environments. The demonstration will allow users to issue unconstrained spoken language commands to ScoutBot. ScoutBot will prompt for clarification if the user’s instruction needs additional input. It is trained on human-robot dialogue collected from Wizard-of-Oz experiments, where robot responses were initiated by a human wizard in previous interactions. The demonstration will show a simulated ground robot (Clearpath Jackal) in a simulated environment supported by ROS (Robot Operating System).
2017
pdf
bib
abs
Exploring Variation of Natural Human Commands to a Robot in a Collaborative Navigation Task
Matthew Marge
|
Claire Bonial
|
Ashley Foots
|
Cory Hayes
|
Cassidy Henry
|
Kimberly Pollard
|
Ron Artstein
|
Clare Voss
|
David Traum
Proceedings of the First Workshop on Language Grounding for Robotics
Robot-directed communication is variable, and may change based on human perception of robot capabilities. To collect training data for a dialogue system and to investigate possible communication changes over time, we developed a Wizard-of-Oz study that (a) simulates a robot’s limited understanding, and (b) collects dialogues where human participants build a progressively better mental model of the robot’s understanding. With ten participants, we collected ten hours of human-robot dialogue. We analyzed the structure of instructions that participants gave to a remote robot before it responded. Our findings show a general initial preference for including metric information (e.g., move forward 3 feet) over landmarks (e.g., move to the desk) in motion commands, but this decreased over time, suggesting changes in perception.