%0 Conference Proceedings %T Text Tokenization for Knowledge-free Automatic Extraction of Lexical Similarities %A Thanopoulos, Aristomenis %A Fakotakis, Nikos %A Kokkinakis, George %Y Daille, Béatrice %Y Morin, Emmanuel %S Actes de la 10ème conférence sur le Traitement Automatique des Langues Naturelles. Posters %D 2003 %8 June %I ATALA %C Batz-sur-Mer, France %F thanopoulos-etal-2003-text %X Previous studies on automatic extraction of lexical similarities have considered as semantic unit of text the word. However, the theory of contextual lexical semantics implies that larger segments of text, namely non-compositional multiwords, are more appropriate for this role. We experimentally tested the applicability of this notion applying automatic collocation extraction to identify and merge such multiwords prior to the similarity estimation process. Employing an automatic WordNet-based comparative evaluation scheme along with a manual evaluation procedure, we ascertain improvement of the extracted similarity relations. %U https://aclanthology.org/2003.jeptalnrecital-poster.17 %P 397-402