A Trainable Tokenizer, solution for multilingual texts and compound expression tokenization Oana Frunza author 2008-05 text Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC’08) Nicoletta Calzolari editor Khalid Choukri editor Bente Maegaard editor Joseph Mariani editor Jan Odijk editor Stelios Piperidis editor Daniel Tapias editor European Language Resources Association (ELRA) Marrakech, Morocco conference publication frunza-2008-trainable https://aclanthology.org/L08-1590/ 2008-05