Integrating Table Representations into Large Language Models for Improved Scholarly Document Comprehension

Buse Sibel Korkmaz, Antonio Del Rio Chanona


Abstract
We address the challenge of interpreting and reasoning over scientific tables with Large Language Models (LLMs), a crucial aspect of scholarly documents. Despite significant progress in natural language processing, the integration of tabular data into scientific LLMs remains limited. We propose an innovative approach leveraging intermediate task pre-training on table question-answering datasets, followed by model adaptation to comprehend tables in computer science literature. Our findings reveal that incorporating table understanding substantially improves the performance of LLMs on scientific literature understanding tasks, which we showcase in peer-review score prediction. This improvement underscores the importance of utilizing tabular data in the training of scientific language models. The code and models are publicly available at [this link](https://github.com/buseskorkmaz/Integrating-Table-Representations-into-LLMs).
Anthology ID:
2024.sdp-1.28
Volume:
Proceedings of the Fourth Workshop on Scholarly Document Processing (SDP 2024)
Month:
August
Year:
2024
Address:
Bangkok, Thailand
Editors:
Tirthankar Ghosal, Amanpreet Singh, Anita Waard, Philipp Mayr, Aakanksha Naik, Orion Weller, Yoonjoo Lee, Shannon Shen, Yanxia Qin
Venues:
sdp | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
293–306
Language:
URL:
https://aclanthology.org/2024.sdp-1.28
DOI:
Bibkey:
Cite (ACL):
Buse Sibel Korkmaz and Antonio Del Rio Chanona. 2024. Integrating Table Representations into Large Language Models for Improved Scholarly Document Comprehension. In Proceedings of the Fourth Workshop on Scholarly Document Processing (SDP 2024), pages 293–306, Bangkok, Thailand. Association for Computational Linguistics.
Cite (Informal):
Integrating Table Representations into Large Language Models for Improved Scholarly Document Comprehension (Korkmaz & Del Rio Chanona, sdp-WS 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.sdp-1.28.pdf