%0 Conference Proceedings %T D(H)ante: A New Set of Tools for XIII Century Italian %A Basile, Angelo %A Sangati, Federico %Y Calzolari, Nicoletta %Y Choukri, Khalid %Y Declerck, Thierry %Y Goggi, Sara %Y Grobelnik, Marko %Y Maegaard, Bente %Y Mariani, Joseph %Y Mazo, Helene %Y Moreno, Asuncion %Y Odijk, Jan %Y Piperidis, Stelios %S Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC’16) %D 2016 %8 May %I European Language Resources Association (ELRA) %C Portorož, Slovenia %F basile-sangati-2016-h %X In this paper we describe 1) the process of converting a corpus of Dante Alighieri from a TEI XML format in to a pseudo-CoNLL format; 2) how a pos-tagger trained on modern Italian performs on Dante’s Italian 3) the performances of two different pos-taggers trained on the given corpus. We are making our conversion scripts and models available to the community. The two other models trained on the corpus performs reasonably well. The tool used for the conversion process might turn useful for bridging the gap between traditional digital humanities and modern NLP applications since the TEI original format is not usually suitable for being processed with standard NLP tools. We believe our work will serve both communities: the DH community will be able to tag new documents and the NLP world will have an easier way in converting existing documents to a standardized machine-readable format. %U https://aclanthology.org/L16-1450 %P 2825-2828