ACL-IJCNLP 2009
NEWS 2009
2009 Named Entities Workshop:
Shared Task on Transliteration
Proceedings of the Workshop
7 August 2009
Table of Contents
Report of NEWS 2009 Machine Transliteration Shared
Task
Haizhou Li, A Kumaran, Vladimir Pervouchine and Min Zhang......................................... 1
Whitepaper of NEWS 2009 Machine Transliteration
Shared Task
Haizhou Li, A Kumaran, Min Zhang and Vladimir Pervouchine....................................... 19
Automata for Transliteration and Machine
Translation
Kevin Knight..................................................................................................................... 27
DirecTL: a Language Independent Approach to
Transliteration
Sittichai
Jiampojamarn, Aditya Bhargava, Qing Dou, Kenneth Dwyer and Grzegorz Kondrak..
. 28
Named Entity Transcription with Pair n-Gram Models
Martin Jansche and Richard Sproat.................................................................................... 32
Jong-Hoon Oh, Kiyotaka Uchimoto and Kentaro Torisawa............................................... 36
A Language-Independent Transliteration Schema
Using Character Aligned Models at NEWS 2009
Praneeth Shishtla, Surya Ganesh V, Sethuramalingam
Subramaniam and Vasudeva Varma 40
Experiences
with English-Hindi, English-Tamil and English-Kannada Transliteration Tasks at
NEWS
2009
Manoj Kumar Chinnakotla and
Testing and Performance Evaluation of Machine Transliteration
System for Tamil Language
Kommaluri Vijayanand, Inampudi Ramesh Babu and Poonguzhali
Sandiran.................... 48
Transliteration by Bidirectional Statistical Machine
Translation
Andrew Finch and Eiichiro Sumita.................................................................................... 52
Transliteration of Name Entity via Improved
Statistical Translation on Character Sequences
Yan Song, Chunyu Kit and Xiao Chen.............................................................................. 57
Learning Multi Character Alignment Rules and
Classification of Training Data for Transliteration
Dipankar Bose and Sudeshna Sarkar................................................................................. 61
Fast Decoding and Easy Implementation:
Transliteration as Sequential Labeling
Eiji Aramaki and Takeshi Abekawa................................................................................... 65
Colin Cherry and Hisami Suzuki....................................................................................... 69
Dong Yang, Paul
Dixon, Yi-Cheng Pan, Tasuku Oonishi, Masanobu Nakamura and Sadaoki Furui 72
Oi Yee Kwong................................................................................................................... 76
English to Hindi Machine Transliteration System at
NEWS 2009
Amitava Das, Asif Ekbal, Tapabrata Mondal and Sivaji
Bandyopadhyay.......................... 80
Improving Transliteration Accuracy Using
Word-Origin Detection and Lexicon Lookup
Mitesh Khapra and Pushpak Bhattacharyya....................................................................... 84
A Noisy Channel Model for Grapheme-based Machine
Transliteration
Jia Yuxiang, Zhu Danqing and Yu Shiwen........................................................................ 88
Substring-based Transliteration with Conditional
Random Fields
Sravana Reddy and Sonjia Waxmonsky............................................................................ 92
A Syllable-based Name Transliteration System
Xue Jiang, Le Sun and Dakun Zhang................................................................................ 96
Transliteration System Using Pair HMM with
Weighted FSTs
Peter Nabende.................................................................................................................. 100
English-Hindi Transliteration Using Context-Informed
PB-SMT: the DCU System for NEWS 2009
Rejwanul Haque, Sandipan Dandapat, Ankit Kumar Srivastava,
Sudip Kumar Naskar and
Way
A Hybrid Approach to English-Korean Name
Transliteration
Gumwon Hong, Min-Jeong Kim, Do-Gil Lee and Hae-Chang Rim................................. 108
Language Independent Transliteration System Using Phrase-based
SMT Approach on Substrings
Sara Noeman.................................................................................................................... 112
Combining MDL Transliteration Training with
Discriminative Modeling
Dmitry Zelenko............................................................................................................... 116
Î-extension Hidden Markov Models and Weighted Transducers
for Machine Transliteration
Balakrishnan Vardarajan and Delip Rao.......................................................................... 120
Modeling Machine Transliteration as a Phrase Based
Statistical Machine Translation Problem
Taraka Rama and Karthik Gali......................................................................................... 124
Maximum n-Gram HMM-based Name Transliteration:
Experiment in NEWS 2009 on English-Chinese
Corpus
Yilu Zhou......................................................................................................................... 128
Name Transliteration with Bidirectional Perceptron
Edit Models
Dayne Freitag and Zhiqiang Wang................................................................................... 132
Bridging Languages by SuperSense Entity Tagging
Davide Picca, Alfio Massimiliano Gliozzo and Simone Campora.................................... 136
Chinese-English Organization Name Translation Based on
Correlative Expansion
Feiliang Ren, Muhua Zhu, Huizhen Wang and Jingbo Zhu.............................................. 143
Name Matching between Roman and Chinese Scripts:
Machine Complements Human
Ken Samuel, Alan Rubenstein, Sherri Condon and Alex Yeh.......................................... 152
Analysis and Robust Extraction of Changing Named
Entities
Masatoshi Tsuchiya, Shoko Endo and Seiichi Nakagawa................................................. 161
Tag Confidence Measure for Semi-Automatically
Updating Named Entity Recognition
Kuniko Saito and Kenji Imamura..................................................................................... 168
A Hybrid Model for Urdu Hindi Transliteration
Abbas Malik, Laurent Besacier, Christian Boitet and Pushpak
Bhattacharyya................. 177
Graphemic Approximation of Phonological Context
for English-Chinese Transliteration
Oi Yee Kwong................................................................................................................. 186
Czech Named Entity Corpus and SVM-based
Recognizer
Jana Kravalová and Zdeněk Žabokrtský.......................................................................... 194
Voted NER System using Appropriate Unlabeled Data
Asif Ekbal and Sivaji Bandyopadhyay............................................................................ 202