Towards a Clean Text Corpus for Ottoman Turkish Fatih Karagöz author Berat Doğan author Şaziye Betül Özateş author 2024-08 text Proceedings of the First Workshop on Natural Language Processing for Turkic Languages (SIGTURK 2024) Duygu Ataman editor Mehmet Oguz Derin editor Sardana Ivanova editor Abdullatif Köksal editor Jonne Sälevä editor Deniz Zeyrek editor Association for Computational Linguistics Bangkok, Thailand and Online conference publication karagoz-etal-2024-towards https://aclanthology.org/2024.sigturk-1.6/ 2024-08 62 70