James Breen
2018
The Company They Keep: Extracting Japanese Neologisms Using Language Patterns
James Breen
|
Timothy Baldwin
|
Francis Bond
Proceedings of the 9th Global Wordnet Conference
We describe an investigation into the identification and extraction of unrecorded potential lexical items in Japanese text by detecting text passages containing selected language patterns typically associated with such items. We identified a set of suitable patterns, then tested them with two large collections of text drawn from the WWW and Twitter. Samples of the extracted items were evaluated, and it was demonstrated that the approach has considerable potential for identifying terms for later lexicographic analysis.
2012
Segmentation and Translation of Japanese Multi-word Loanwords
James Breen
|
Timothy Baldwin
|
Francis Bond
Proceedings of the Australasian Language Technology Association Workshop 2012
2009
Corpus-based Extraction of Japanese Compound Verbs
James Breen
|
Timothy Baldwin
Proceedings of the Australasian Language Technology Association Workshop 2009
Search