AdminSet and AdminBERT: a Dataset and a Pre-trained Language Model to Explore the Unstructured Maze of French Administrative Documents Thomas Sebbag author Solen Quiniou author Nicolas Stucky author Emmanuel Morin author 2025-01 text Proceedings of the 31st International Conference on Computational Linguistics Owen Rambow editor Leo Wanner editor Marianna Apidianaki editor Hend Al-Khalifa editor Barbara Di Eugenio editor Steven Schockaert editor Association for Computational Linguistics Abu Dhabi, UAE conference publication sebbag-etal-2025-adminset https://aclanthology.org/2025.coling-main.27/ 2025-01 392 406