Andrej Ljolje

Also published as: A Ljolje, A. Ljolje


2023

pdf bib
1-step Speech Understanding and Transcription Using CTC Loss
Karan Singla | Shahab Jalalv | Yeon-Jun Kim | Andrej Ljolje | Antonio Moreno Daniel | Srinivas Bangalore | Benjamin Stern
Proceedings of the 20th International Conference on Natural Language Processing (ICON)

Recent studies have made some progress in refining end-to-end (E2E) speech recognition encoders by applying Connectionist Temporal Classification (CTC) loss to enhance named entity recognition within transcriptions. However, these methods have been constrained by their exclusive use of the ASCII character set, allowing only a limited array of semantic labels. We propose 1SPU, a 1-step Speech Processing Unit which can recognize speech events (e.g: speaker change) or an NL event (Intent, Emotion) while also transcribing vocal content. It extends the E2E automatic speech recognition (ASR) system’s vocabulary by adding a set of unused placeholder symbols, conceptually akin to the <pad> tokens used in sequence modeling. These placeholders are then assigned to represent semantic events (in form of tags) and are integrated into the transcription process as distinct tokens. We demonstrate notable improvements on the SLUE benchmark and yields results that are on par with those for the SLURP dataset. Additionally, we provide a visual analysis of the system’s proficiency in accurately pinpointing meaningful tokens over time, illustrating the enhancement in transcription quality through the utilization of supplementary semantic tags.

2013

pdf bib
Segmentation Strategies for Streaming Speech Translation
Vivek Kumar Rangarajan Sridhar | John Chen | Srinivas Bangalore | Andrej Ljolje | Rathinavelu Chengalvarayan
Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies

1991

pdf bib
Lexical Access With a Statistically-Derived Phonetic Network
Michael D. Riley | Andrej Ljolje
Speech and Natural Language: Proceedings of a Workshop Held at Pacific Grove, California, February 19-22, 1991

1990

pdf bib
Continuous Speech Recognition from a Phonetic Transcription
S. E. Levinson | A. Ljolje | L. G. Miller
Speech and Natural Language: Proceedings of a Workshop Held at Hidden Valley, Pennsylvania, June 24-27,1990

1989

pdf bib
Speaker Independent Phonetic Transcription of Fluent Speech for Large Vocabulary Speech Recognition
S. E. Levinson | M. Y. Liberman | A. Ljolje | L. G. Miller
Speech and Natural Language: Proceedings of a Workshop Held at Philadelphia, Pennsylvania, February 21-23, 1989

pdf bib
Continuous Speech Recognition from Phonetic Transcription
S. E. Levinson | A Ljolje
Speech and Natural Language: Proceedings of a Workshop Held at Cape Cod, Massachusetts, October 15-18, 1989