FS-DAG: Few Shot Domain Adapting Graph Networks for Visually Rich Document Understanding

Amit Agarwal; Srikant Panda; Kulbhushan Pachauri

FS-DAG: Few Shot Domain Adapting Graph Networks for Visually Rich Document Understanding

Amit Agarwal, Srikant Panda, Kulbhushan Pachauri

Abstract

In this work, we propose Few Shot Domain Adapting Graph (FS-DAG), a scalable and efficient model architecture for visually rich document understanding (VRDU) in few-shot settings. FS-DAG leverages domain-specific and language/vision specific backbones within a modular framework to adapt to diverse document types with minimal data. The model is robust to practical challenges such as handling OCR errors, misspellings, and domain shifts, which are critical in real-world deployments. FS-DAG is highly performant with less than 90M parameters, making it well-suited for complex real-world applications for Information Extraction (IE) tasks where computational resources are limited. We demonstrate FS-DAG’s capability through extensive experiments for information extraction task, showing significant improvements in convergence speed and performance compared to state-of-the-art methods. Additionally, this work highlights the ongoing progress in developing smaller, more efficient models that do not compromise on performance.

Anthology ID:: 2025.coling-industry.9
Volume:: Proceedings of the 31st International Conference on Computational Linguistics: Industry Track
Month:: January
Year:: 2025
Address:: Abu Dhabi, UAE
Editors:: Owen Rambow, Leo Wanner, Marianna Apidianaki, Hend Al-Khalifa, Barbara Di Eugenio, Steven Schockaert, Kareem Darwish, Apoorv Agarwal
Venue:: COLING
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 100–114
Language:
URL:: https://aclanthology.org/2025.coling-industry.9/
DOI:
Bibkey:
Cite (ACL):: Amit Agarwal, Srikant Panda, and Kulbhushan Pachauri. 2025. FS-DAG: Few Shot Domain Adapting Graph Networks for Visually Rich Document Understanding. In Proceedings of the 31st International Conference on Computational Linguistics: Industry Track, pages 100–114, Abu Dhabi, UAE. Association for Computational Linguistics.
Cite (Informal):: FS-DAG: Few Shot Domain Adapting Graph Networks for Visually Rich Document Understanding (Agarwal et al., COLING 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.coling-industry.9.pdf

PDF Cite Search Fix data