DocHieNet: A Large and Diverse Dataset for Document Hierarchy Parsing Hangdi Xing author Changxu Cheng author Feiyu Gao author Zirui Shao author Zhi Yu author Jiajun Bu author Qi Zheng author Cong Yao author 2024-11 text Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing Yaser Al-Onaizan editor Mohit Bansal editor Yun-Nung Chen editor Association for Computational Linguistics Miami, Florida, USA conference publication xing-etal-2024-dochienet 10.18653/v1/2024.emnlp-main.65 https://aclanthology.org/2024.emnlp-main.65/ 2024-11 1129 1142