Minghao Guo


2023

pdf bib
OpenRT: An Open-source Framework for Reasoning Over Tabular Data
Yilun Zhao | Boyu Mi | Zhenting Qi | Linyong Nan | Minghao Guo | Arman Cohan | Dragomir Radev
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations)

There are a growing number of table pre-training methods proposed for reasoning over tabular data (e.g., question answering, fact checking, and faithful text generation). However, most existing methods are benchmarked solely on a limited number of datasets, varying in configuration, which leads to a lack of unified, standardized, fair, and comprehensive comparison between methods. This paper presents OpenRT, the first open-source framework for reasoning over tabular data, to reproduce existing table pre-training models for performance comparison and develop new models quickly. We implemented and compared six table pre-training models on four question answering, one fact checking, and one faithful text generation datasets. Moreover, to enable the community to easily construct new table reasoning datasets, we developed TaRAT, an annotation tool which supports multi-person collaborative annotations for various kinds of table reasoning tasks. The researchers are able to deploy the newly-constructed dataset to OpenRT and compare the performances of different baseline systems.

2020

pdf bib
发音属性优化建模及其在偏误检测的应用(Speech attributes optimization modeling and application in mispronunciation detection)
Minghao Guo (郭铭昊) | Yanlu Xie (解焱陆)
Proceedings of the 19th Chinese National Conference on Computational Linguistics

近年来,发音属性常常被用于计算机辅助发音训练系统(CAPT)中。本文针对使用发音属性的一些难点,提出了一种建模细颗粒度发音属性(FSA)的方法,并在跨语言属性识别、发音偏误检测中进行测试。最终,我们得到了最优平均识别准确率约为95%的属性检测器组;在两个二语测试集上的偏误检测,相比基线,基于FSA方法均获得了超过1%的性能提升。此外,我们还根据发音属性的跨语言特性设置了对照实验,并在上述任务中测试和分析。