Analysing The Impact of Sequence Composition on Language Model Pre-Training Yu Zhao author Yuanbin Qu author Konrad Staniszewski author Szymon Tworkowski author Wei Liu author Piotr Miłoś author Yuxiang Wu author Pasquale Minervini author 2024-08 text Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) Lun-Wei Ku editor Andre Martins editor Vivek Srikumar editor Association for Computational Linguistics Bangkok, Thailand conference publication zhao-etal-2024-analysing 10.18653/v1/2024.acl-long.427 https://aclanthology.org/2024.acl-long.427/ 2024-08 7897 7912