Sunil Kumar Kopparapu


2021

pdf bib
Impact of Microphone position Measurement Error on Multi Channel Distant Speech Recognition & Intelligibility
Karan Nathwani | Sunil Kumar Kopparapu
Proceedings of the 18th International Conference on Natural Language Processing (ICON)

It was shown in (Raikar et al., 2020) that the measurement error in the microphone position affected the room impulse response (RIR) which in turn affected the single channel speech recognition. In this paper, we ex-tend this to study the more complex and realistic scenario of multi channel distant speech recognition. Specifically we simulate m speakers in a given room with n microphones speaking without overlap. Then channel audio is beamformed and passed through a speech to text (s2t) engine. We compare the s2t accuracy when the microphone locations are known exactly (ground truth) with the s2t accuracy when there is a measurement error in the location of the microphone. We report the performance of an end-to-end s2t on beamformed input in terms of character error rate (CER) and and also speech intelligibility and quality in terms of STOI and PESQ respectively.

2018

pdf bib
Sentiment Analysis using Imperfect Views from Spoken Language and Acoustic Modalities
Imran Sheikh | Sri Harsha Dumpala | Rupayan Chakraborty | Sunil Kumar Kopparapu
Proceedings of Grand Challenge and Workshop on Human Multimodal Language (Challenge-HML)

Multimodal sentiment classification in practical applications may have to rely on erroneous and imperfect views, namely (a) language transcription from a speech recognizer and (b) under-performing acoustic views. This work focuses on improving the representations of these views by performing a deep canonical correlation analysis with the representations of the better performing manual transcription view. Enhanced representations of the imperfect views can be obtained even in absence of the perfect views and give an improved performance during test conditions. Evaluations on the CMU-MOSI and CMU-MOSEI datasets demonstrate the effectiveness of the proposed approach.

2017

pdf bib
Correcting General Purpose ASR Errors using Posteriors
Sunil Kumar Kopparapu | C. Anantaram
Proceedings of the 14th International Conference on Natural Language Processing (ICON-2017)

2016

pdf bib
Mining Call Center Conversations Exhibiting Similar Affective States
Rupayan Chakraborty | Meghna Pandharipande | Sunil Kumar Kopparapu
Proceedings of the 30th Pacific Asia Conference on Language, Information and Computation: Posters

2013

pdf bib
Visual Subtitles for Internet Videos
Chitralekha Bhat | Imran Ahmed | Vikram Saxena | Sunil Kumar Kopparapu
Proceedings of the Fourth Workshop on Speech and Language Processing for Assistive Technologies