2013
pdf
bib
Comparative Analyses of Textual Contents and Styles of Five Major Japanese Newspapers
Takafumi Suzuki
|
Erina Kanou
|
Yui Arakawa
Proceedings of the 27th Pacific Asia Conference on Language, Information, and Computation (PACLIC 27)
2012
pdf
bib
abs
Detecting Japanese Compound Functional Expressions using Canonical/Derivational Relation
Takafumi Suzuki
|
Yusuke Abe
|
Itsuki Toyota
|
Takehito Utsuro
|
Suguru Matsuyoshi
|
Masatoshi Tsuchiya
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
The Japanese language has various types of functional expressions. In order to organize Japanese functional expressions with various surface forms, a lexicon of Japanese functional expressions with hierarchical organization was compiled. This paper proposes how to design the framework of identifying more than 16,000 functional expressions in Japanese texts by utilizing hierarchical organization of the lexicon. In our framework, more than 16,000 functional expressions are roughly divided into canonical / derived functional expressions. Each derived functional expression is intended to be identified by referring to the most similar occurrence of its canonical expression. In our framework, contextual occurrence information of much fewer canonical expressions are expanded into the whole forms of derived expressions, to be utilized when identifying those derived expressions. We also empirically show that the proposed method can correctly identify more than 80% of the functional / content usages only with less than 38,000 training instances of manually identified canonical expressions.
2011
pdf
bib
Analyzing the characteristics of academic paper categories by using an index of representativeness
Takafumi Suzuki
|
Kiyoko Uchiyama
|
Ryota Tomisaka
|
Akiko Aizawa
Proceedings of the 25th Pacific Asia Conference on Language, Information and Computation
pdf
bib
Example-based Translation of Japanese Functional Expressions utilizing Semantic Equivalence Classes
Yusuke Abe
|
Takafumi Suzuki
|
Bing Liang
|
Takehito Utsuro
|
Mikio Yamamoto
|
Suguru Matsuyoshi
|
Yasuhide Kawada
Proceedings of the 4th Workshop on Patent Translation
2010
pdf
bib
abs
Utilizing Semantic Equivalence Classes of Japanese Functional Expressions in Translation Rule Acquisition from Parallel Patent Sentences
Taiji Nagasaka
|
Ran Shimanouchi
|
Akiko Sakamoto
|
Takafumi Suzuki
|
Yohei Morishita
|
Takehito Utsuro
|
Suguru Matsuyoshi
Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)
In the ``Sandglass'' MT architecture, we identify the class of monosemous Japanese functional expressions and utilize it in the task of translating Japanese functional expressions into English. We employ the semantic equivalence classes of a recently compiled large scale hierarchical lexicon of Japanese functional expressions. We then study whether functional expressions within a class can be translated into a single canonical English expression. Based on the results of identifying monosemous semantic equivalence classes, this paper studies how to extract rules for translating functional expressions in Japanese patent documents into English. In this study, we use about 1.8M Japanese-English parallel sentences automatically extracted from Japanese-English patent families, which are distributed through the Patent Translation Task at the NTCIR-7 Workshop. Then, as a toolkit of a phrase-based SMT (Statistical Machine Translation) model, Moses is applied and Japanese-English translation pairs are obtained in the form of a phrase translation table. Finally, we extract translation pairs of Japanese functional expressions from the phrase translation table. Through this study, we found that most of the semantic equivalence classes judged as monosemous based on manual translation into English have only one translation rules even in the patent domain.
2007
pdf
bib
Exploring the Microscopic Textual Characteristics of Japanese Prime Ministers’ Diet Addresses by Measuring the Quantity and Diversity of Nouns
Takafumi Suzuki
|
Kyo Kageura
Proceedings of the 21st Pacific Asia Conference on Language, Information and Computation