Meisyarah Dwiastuti
2019
English-Indonesian Neural Machine Translation for Spoken Language Domains
Meisyarah Dwiastuti
Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop
In this work, we conduct a study on Neural Machine Translation (NMT) for English-Indonesian (EN-ID) and Indonesian-English (ID-EN). We focus on spoken language domains, namely colloquial and speech languages. We build NMT systems using the Transformer model for both translation directions and implement domain adaptation, in which we train our pre-trained NMT systems on speech language (in-domain) data. Moreover, we conduct an evaluation on how the domain-adaptation method in our EN-ID system can result in more formal translation outputs.
2017
A Shallow Neural Network for Native Language Identification with Character N-grams
Yunita Sari
|
Muhammad Rifqi Fatchurrahman
|
Meisyarah Dwiastuti
Proceedings of the 12th Workshop on Innovative Use of NLP for Building Educational Applications
This paper describes the systems submitted by GadjahMada team to the Native Language Identification (NLI) Shared Task 2017. Our models used a continuous representation of character n-grams which are learned jointly with feed-forward neural network classifier. Character n-grams have been proved to be effective for style-based identification tasks including NLI. Results on the test set demonstrate that the proposed model performs very well on essay and fusion tracks by obtaining more than 0.8 on both F-macro score and accuracy.
Search