DocBank: A Benchmark Dataset for Document Layout Analysis Minghao Li author Yiheng Xu author Lei Cui author Shaohan Huang author Furu Wei author Zhoujun Li author Ming Zhou author 2020-12 text Proceedings of the 28th International Conference on Computational Linguistics Donia Scott editor Nuria Bel editor Chengqing Zong editor International Committee on Computational Linguistics Barcelona, Spain (Online) conference publication li-etal-2020-docbank 10.18653/v1/2020.coling-main.82 https://aclanthology.org/2020.coling-main.82/ 2020-12 949 960