Evaluating Inter-Bilingual Semantic Parsing for Indian Languages

Divyanshu Aggarwal, Vivek Gupta, Anoop Kunchukuttan


Abstract
Despite significant progress in Natural Language Generation for Indian languages (IndicNLP), there is a lack of datasets around complex structured tasks such as semantic parsing. One reason for this imminent gap is the complexity of the logical form, which makes English to multilingual translation difficult. The process involves alignment of logical forms, intents and slots with translated unstructured utterance. To address this, we propose an Inter-bilingual Seq2seq Semantic parsing dataset IE-SemParse Suite for 11 distinct Indian languages. We highlight the proposed task’s practicality, and evaluate existing multilingual seq2seq models across several train-test strategies. Our experiment reveals a high correlation across performance of original multilingual semantic parsing datasets (such as mTOP, multilingual TOP and multiATIS++) and our proposed IE-SemParse suite.
Anthology ID:
2023.nlp4convai-1.9
Volume:
Proceedings of the 5th Workshop on NLP for Conversational AI (NLP4ConvAI 2023)
Month:
July
Year:
2023
Address:
Toronto, Canada
Editors:
Yun-Nung Chen, Abhinav Rastogi
Venue:
NLP4ConvAI
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
102–122
Language:
URL:
https://aclanthology.org/2023.nlp4convai-1.9
DOI:
10.18653/v1/2023.nlp4convai-1.9
Bibkey:
Cite (ACL):
Divyanshu Aggarwal, Vivek Gupta, and Anoop Kunchukuttan. 2023. Evaluating Inter-Bilingual Semantic Parsing for Indian Languages. In Proceedings of the 5th Workshop on NLP for Conversational AI (NLP4ConvAI 2023), pages 102–122, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):
Evaluating Inter-Bilingual Semantic Parsing for Indian Languages (Aggarwal et al., NLP4ConvAI 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.nlp4convai-1.9.pdf
Video:
 https://aclanthology.org/2023.nlp4convai-1.9.mp4