Rattima Nitisaroj - ACL Anthology

Rattima Nitisaroj

2023

Research interest in task-oriented dialogs has increased as systems such as Google Assistant, Alexa and Siri have become ubiquitous in everyday life. However, the impact of academic research in this area has been limited by the lack of datasets that realistically capture the wide array of user pain points. To enable research on some of the more challenging aspects of parsing realistic conversations, we introduce PRESTO, a public dataset of over 550K contextual multilingual conversations between humans and virtual assistants. PRESTO contains a diverse array of challenges that occur in real-world NLU tasks such as disfluencies, code-switching, and revisions. It is the only large scale human generated conversational parsing dataset that provides structured context such as a user’s contacts and lists for each example. Our mT5 model based baselines demonstrate that the conversational phenomenon present in PRESTO are challenging to model, which is further pronounced in a low-resource setup.

2017

CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies
Daniel Zeman | Martin Popel | Milan Straka | Jan Hajič | Joakim Nivre | Filip Ginter | Juhani Luotolahti | Sampo Pyysalo | Slav Petrov | Martin Potthast | Francis Tyers | Elena Badmaeva | Memduh Gokirmak | Anna Nedoluzhko | Silvie Cinková | Jan Hajič jr. | Jaroslava Hlaváčová | Václava Kettnerová | Zdeňka Urešová | Jenna Kanerva | Stina Ojala | Anna Missilä | Christopher D. Manning | Sebastian Schuster | Siva Reddy | Dima Taji | Nizar Habash | Herman Leung | Marie-Catherine de Marneffe | Manuela Sanguinetti | Maria Simi | Hiroshi Kanayama | Valeria de Paiva | Kira Droganova | Héctor Martínez Alonso | Çağrı Çöltekin | Umut Sulubacak | Hans Uszkoreit | Vivien Macketanz | Aljoscha Burchardt | Kim Harris | Katrin Marheinecke | Georg Rehm | Tolga Kayadelen | Mohammed Attia | Ali Elkahky | Zhuoran Yu | Emily Pitler | Saran Lertpradit | Michael Mandl | Jesse Kirchner | Hector Fernandez Alcalde | Jana Strnadová | Esha Banerjee | Ruli Manurung | Antonio Stella | Atsuko Shimada | Sookyoung Kwak | Gustavo Mendonça | Tatiana Lando | Rattima Nitisaroj | Josie Li
Proceedings of the CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies

The Conference on Computational Natural Language Learning (CoNLL) features a shared task, in which participants train and test their learning systems on the same data sets. In 2017, the task was devoted to learning dependency parsers for a large number of languages, in a real-world setting without any gold-standard annotation on input. All test sets followed a unified annotation scheme, namely that of Universal Dependencies. In this paper, we define the task and evaluation methodology, describe how the data sets were prepared, report and analyze the main results, and provide a brief categorization of the different approaches of the participating systems.

2003

Voicing Constraint and Segmental-Tonal Neighborhood Effects on Clusters in Thai
Rattima Nitisaroj
Proceedings of the 17th Pacific Asia Conference on Language, Information and Computation

Co-authors

Aljoscha Burchardt 1

HyunJeong Choe 1

Silvie Cinková 1

Kira Droganova 1

Memduh Gökırmak 1

Jan Hajič jr. 1

Jaroslava Hlaváčová 1

Hiroshi Kanayama 1

Jenna Kanerva 1

Tolga Kayadelen 1

Václava Kettnerová 1

Jesse Kirchner 1

Sookyoung Kwak 1

Tatiana Lando 1

Saran Lertpradit 1

Juhani Luotolahti 1

Vivien Macketanz 1

Michael Mandel 1

Christopher D. Manning 1

Ruli Manurung 1

Katrin Marheinecke 1

Héctor Martínez Alonso 1

Gustavo Mendonca 1

Anna Missilä 1

Anna Nedoluzhko 1

Martin Potthast 1

Sampo Pyysalo 1

Manuela Sanguinetti 1

Sebastian Schuster 1

Atsuko Shimada 1

Antonio Stella 1

Jana Strnadová 1

Umut Sulubacak 1

Anna Trukhina 1

Francis Tyers 1

Zdenka Uresova 1

Hans Uszkoreit 1

Siddharth Vashishtha 1

Marie-Catherine de Marneffe 1

Valeria de Paiva 1

Çağrı Çöltekin 1

Venues