Error Analysis of ToTTo Table-to-Text Neural NLG Models

Barkavi Sundararajan; Somayajulu Sripada; Ehud Reiter

doi:10.18653/v1/2022.gem-1.43

Error Analysis of ToTTo Table-to-Text Neural NLG Models

Barkavi Sundararajan, Somayajulu Sripada, Ehud Reiter

Abstract

We report error analysis of outputs from four Table-to-Text generation models fine-tuned on ToTTo, an open-domain English language dataset. We carried out a manual error annotation of a subset of outputs (a total of 3,016 sentences) belonging to the topic of Politics generated by these four models. Our error annotation focused on eight categories of errors. The error analysis shows that more than 46% of sentences from each of the four models have been error-free. It uncovered some of the specific classes of errors; for example, WORD errors (mostly verbs and prepositions) are the dominant errors in all four models and are the most complex ones among other errors. NAME (mostly nouns) and NUMBER errors are slightly higher in two of the GeM benchmark models, whereas DATE-DIMENSION and OTHER categories of errors are more common in our Table-to-Text model. This in-depth error analysis is currently guiding us in improving our Table-to-Text model.

Anthology ID:: 2022.gem-1.43
Volume:: Proceedings of the Second Workshop on Natural Language Generation, Evaluation, and Metrics (GEM)
Month:: December
Year:: 2022
Address:: Abu Dhabi, United Arab Emirates (Hybrid)
Editors:: Antoine Bosselut, Khyathi Chandu, Kaustubh Dhole, Varun Gangal, Sebastian Gehrmann, Yacine Jernite, Jekaterina Novikova, Laura Perez-Beltrachini
Venue:: GEM
SIG:: SIGGEN
Publisher:: Association for Computational Linguistics
Note:
Pages:: 456–470
Language:
URL:: https://aclanthology.org/2022.gem-1.43/
DOI:: 10.18653/v1/2022.gem-1.43
Bibkey:
Cite (ACL):: Barkavi Sundararajan, Somayajulu Sripada, and Ehud Reiter. 2022. Error Analysis of ToTTo Table-to-Text Neural NLG Models. In Proceedings of the Second Workshop on Natural Language Generation, Evaluation, and Metrics (GEM), pages 456–470, Abu Dhabi, United Arab Emirates (Hybrid). Association for Computational Linguistics.
Cite (Informal):: Error Analysis of ToTTo Table-to-Text Neural NLG Models (Sundararajan et al., GEM 2022)
Copy Citation:
PDF:: https://aclanthology.org/2022.gem-1.43.pdf
Video:: https://aclanthology.org/2022.gem-1.43.mp4

PDF Cite Search Video Fix data