Latent Feature-based Data Splits to Improve Generalisation Evaluation: A Hate Speech Detection Case Study Maike Züfle author Verna Dankers author Ivan Titov author 2023-12 text Proceedings of the 1st GenBench Workshop on (Benchmarking) Generalisation in NLP Dieuwke Hupkes editor Verna Dankers editor Khuyagbaatar Batsuren editor Koustuv Sinha editor Amirhossein Kazemnejad editor Christos Christodoulopoulos editor Ryan Cotterell editor Elia Bruni editor Association for Computational Linguistics Singapore conference publication zufle-etal-2023-latent 10.18653/v1/2023.genbench-1.9 https://aclanthology.org/2023.genbench-1.9/ 2023-12 112 129