James Breen


2018

pdf bib
The Company They Keep: Extracting Japanese Neologisms Using Language Patterns
James Breen | Timothy Baldwin | Francis Bond
Proceedings of the 9th Global Wordnet Conference

We describe an investigation into the identification and extraction of unrecorded potential lexical items in Japanese text by detecting text passages containing selected language patterns typically associated with such items. We identified a set of suitable patterns, then tested them with two large collections of text drawn from the WWW and Twitter. Samples of the extracted items were evaluated, and it was demonstrated that the approach has considerable potential for identifying terms for later lexicographic analysis.

2012

pdf bib
Segmentation and Translation of Japanese Multi-word Loanwords
James Breen | Timothy Baldwin | Francis Bond
Proceedings of the Australasian Language Technology Association Workshop 2012

2009

pdf bib
Corpus-based Extraction of Japanese Compound Verbs
James Breen | Timothy Baldwin
Proceedings of the Australasian Language Technology Association Workshop 2009