Yoon-Hyung Roh
2019
Data Augmentation by Data Noising for Open-vocabulary Slots in Spoken Language Understanding
Hwa-Yeon Kim | Yoon-Hyung Roh | Young-Kil Kim
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Student Research Workshop
Hwa-Yeon Kim | Yoon-Hyung Roh | Young-Kil Kim
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Student Research Workshop
One of the main challenges in Spoken Language Understanding (SLU) is dealing with ‘open-vocabulary’ slots. Recently, SLU models based on neural network were proposed, but it is still difficult to recognize the slots of unknown words or ‘open-vocabulary’ slots because of the high cost of creating a manually tagged SLU dataset. This paper proposes data noising, which reflects the characteristics of the ‘open-vocabulary’ slots, for data augmentation. We applied it to an attention based bi-directional recurrent neural network (Liu and Lane, 2016) and experimented with three datasets: Airline Travel Information System (ATIS), Snips, and MIT-Restaurant. We achieved performance improvements of up to 0.57% and 3.25 in intent prediction (accuracy) and slot filling (f1-score), respectively. Our method is advantageous because it does not require additional memory and it can be applied simultaneously with the training process of the model.
2011
Improving PP Attachment Disambiguation in a Rule-based Parser
Yoon-Hyung Roh | Ki-Young Lee | Young-Gil Kim
Proceedings of the 25th Pacific Asia Conference on Language, Information and Computation
Yoon-Hyung Roh | Ki-Young Lee | Young-Gil Kim
Proceedings of the 25th Pacific Asia Conference on Language, Information and Computation
2009
Customizing an English-Korean Machine Translation System for Patent/Technical Documents Translation
Oh-Woog Kwon | Sung-Kwon Choi | Ki-Young Lee | Yoon-Hyung Roh | Young-Gil Kim
Proceedings of the 23rd Pacific Asia Conference on Language, Information and Computation, Volume 2
Oh-Woog Kwon | Sung-Kwon Choi | Ki-Young Lee | Yoon-Hyung Roh | Young-Gil Kim
Proceedings of the 23rd Pacific Asia Conference on Language, Information and Computation, Volume 2
Incorporating Statistical Information of Lexical Dependency into a Rule-Based Parser
Yoon-Hyung Roh | Ki-Young Lee | Young-Gil Kim
Proceedings of the 23rd Pacific Asia Conference on Language, Information and Computation, Volume 2
Yoon-Hyung Roh | Ki-Young Lee | Young-Gil Kim
Proceedings of the 23rd Pacific Asia Conference on Language, Information and Computation, Volume 2
2008
How to Overcome the Domain Barriers in Pattern-Based Machine Translation System
Sung-Kwon Choi | Ki-Young Lee | Yoon-Hyung Roh | Oh-Woog Kwon | Young-Gil Kim
Proceedings of the 22nd Pacific Asia Conference on Language, Information and Computation
Sung-Kwon Choi | Ki-Young Lee | Yoon-Hyung Roh | Oh-Woog Kwon | Young-Gil Kim
Proceedings of the 22nd Pacific Asia Conference on Language, Information and Computation
Recognizing Coordinate Structures for Machine Translation of English Patent Documents
Yoon-Hyung Roh | Ki-Young Lee | Sung-Kwon Choi | Oh-Woog Kwon | Young-Gil Kim
Proceedings of the 22nd Pacific Asia Conference on Language, Information and Computation
Yoon-Hyung Roh | Ki-Young Lee | Sung-Kwon Choi | Oh-Woog Kwon | Young-Gil Kim
Proceedings of the 22nd Pacific Asia Conference on Language, Information and Computation
2007
English-Korean patent system: fromTo-EK/PAT
Oh-Woog Kwon | Sung-Kwon Choi | Ki-Young Lee | Yoon-Hyung Roh | Young-Gil Kim | Munpyo Hong
Proceedings of the Workshop on Patent translation
Oh-Woog Kwon | Sung-Kwon Choi | Ki-Young Lee | Yoon-Hyung Roh | Young-Gil Kim | Munpyo Hong
Proceedings of the Workshop on Patent translation
2003
For the proper treatment of long sentences in a sentence pattern-based English-Korean MT system
Yoon-Hyung Roh | Munpyo Hong | Sung-Kwon Choi | Ki-Young Lee | Sang-Kyu Park
Proceedings of Machine Translation Summit IX: Papers
Yoon-Hyung Roh | Munpyo Hong | Sung-Kwon Choi | Ki-Young Lee | Sang-Kyu Park
Proceedings of Machine Translation Summit IX: Papers
This paper describes a sentence pattern-based English-Korean machine translation system backed up by a rule-based module as a solution to the translation of long sentences. A rule-based English-Korean MT system typically suffers from low translation accuracy for long sentences due to poor parsing performance. In the proposed method we only use chunking information on the phrase-level of the parse result (i.e. NP, PP, and AP). By applying a sentence pattern directly to a chunking result, the high performance of analysis and a good quality of translation are expected. The parsing efficiency problem in the traditional RBMT approach is resolved by sentence partitioning, which is generally assumed to have many problems. However, we will show that the sentence partitioning has little side effect, if any, in our approach, because we use only the chunking results for the transfer. The coverage problem of a pattern-based method is overcome by applying sentence pattern matching recursively to the sub-sentences of the input sentence, in case there is no exact matching pattern to the input sentence.