Multi-modal Intent Classification for Assistive Robots with Large-scale Naturalistic Datasets

Karun Varghese Mathew; Venkata S Aditya Tarigoppula; Lea Frermann

Multi-modal Intent Classification for Assistive Robots with Large-scale Naturalistic Datasets

Karun Varghese Mathew, Venkata S Aditya Tarigoppula, Lea Frermann

Abstract

Recent years have brought a tremendous growth in assistive robots/prosthetics for people with partial or complete loss of upper limb control. These technologies aim to help the users with various reaching and grasping tasks in their daily lives such as picking up an object and transporting it to a desired location; and their utility critically depends on the ease and effectiveness of communication between the user and robot. One of the natural ways of communicating with assistive technologies is through verbal instructions. The meaning of natural language commands depends on the current configuration of the surrounding environment and needs to be interpreted in this multi-modal context, as accurate interpretation of the command is essential for a successful execution of the userâs intent by an assistive device. The research presented in this paper demonstrates how large-scale situated natural language datasets can support the development of robust assistive technologies. We leveraged a navigational dataset comprising >25k human-provided natural language commands covering diverse situations. We demonstrated a way to extend the dataset in a task-informed way and use it to develop multi-modal intent classifiers for pick and place tasks. Our best classifier reached >98% accuracy in a 16-way multi-modal intent classification task, suggesting high robustness and flexibility.

Anthology ID:: 2021.alta-1.5
Volume:: Proceedings of the 19th Annual Workshop of the Australasian Language Technology Association
Month:: December
Year:: 2021
Address:: Online
Editors:: Afshin Rahimi, William Lane, Guido Zuccon
Venue:: ALTA
SIG:
Publisher:: Australasian Language Technology Association
Note:
Pages:: 47–57
Language:
URL:: https://aclanthology.org/2021.alta-1.5/
DOI:
Bibkey:
Cite (ACL):: Karun Varghese Mathew, Venkata S Aditya Tarigoppula, and Lea Frermann. 2021. Multi-modal Intent Classification for Assistive Robots with Large-scale Naturalistic Datasets. In Proceedings of the 19th Annual Workshop of the Australasian Language Technology Association, pages 47–57, Online. Australasian Language Technology Association.
Cite (Informal):: Multi-modal Intent Classification for Assistive Robots with Large-scale Naturalistic Datasets (Mathew et al., ALTA 2021)
Copy Citation:
PDF:: https://aclanthology.org/2021.alta-1.5.pdf

PDF Cite Search Fix data