Yasuhiro Minami


2022

pdf bib
Dataset Construction for Scientific-Document Writing Support by Extracting Related Work Section and Citations from PDF Papers
Keita Kobayashi | Kohei Koyama | Hiromi Narimatsu | Yasuhiro Minami
Proceedings of the Thirteenth Language Resources and Evaluation Conference

To augment datasets used for scientific-document writing support research, we extract texts from “Related Work” sections and citation information in PDF-formatted papers published in English. The previous dataset was constructed entirely with Tex-formatted papers, from which it is easy to extract citation information. However, since many publicly available papers in various fields are provided only in PDF format, a dataset constructed using only Tex papers has limited utility. To resolve this problem, we augment the existing dataset by extracting the titles of sections using the visual features of PDF documents and extracting the Related Work section text using the explicit title information. Since text generated from the figures and footnotes appearing in the extraction target areas is considered noise, we remove instances of such text. Moreover, we map the cited paper’s information obtained using existing tools to citation marks detected by regular expression rules, resulting in pairs of cited paper information and text of the Related Work section. By evaluating body text extraction and citation mapping in the constructed dataset, the accuracy of the proposed dataset was found to be close to that of the previous dataset. Accordingly, we demonstrated the possibility of building a significantly augmented dataset.

2021

pdf bib
Task Definition and Integration For Scientific-Document Writing Support
Hiromi Narimatsu | Kohei Koyama | Kohji Dohsaka | Ryuichiro Higashinaka | Yasuhiro Minami | Hirotoshi Taira
Proceedings of the Second Workshop on Scholarly Document Processing

With the increase in the number of published academic papers, growing expectations have been placed on research related to supporting the writing process of scientific papers. Recently, research has been conducted on various tasks such as citation worthiness (judging whether a sentence requires citation), citation recommendation, and citation-text generation. However, since each task has been studied and evaluated using data that has been independently developed, it is currently impossible to verify whether such tasks can be successfully pipelined to effective use in scientific-document writing. In this paper, we first define a series of tasks related to scientific-document writing that can be pipelined. Then, we create a dataset of academic papers that can be used for the evaluation of each task as well as a series of these tasks. Finally, using the dataset, we evaluate the tasks of citation worthiness and citation recommendation as well as both of these tasks integrated. The results of our evaluations show that the proposed approach is promising.

2018

pdf bib
Infant Word Comprehension-to-Production Index Applied to Investigation of Noun Learning Predominance Using Cross-lingual CDI database
Yasuhiro Minami | Tessei Kobayashi | Yuko Okumura
Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)

pdf bib
Analyzing Vocabulary Commonality Index Using Large-scaled Database of Child Language Development
Yan Cao | Yasuhiro Minami | Yuko Okumura | Tessei Kobayashi
Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)

2013

pdf bib
Open-domain Utterance Generation for Conversational Dialogue Systems using Web-scale Dependency Structures
Hiroaki Sugiyama | Toyomi Meguro | Ryuichiro Higashinaka | Yasuhiro Minami
Proceedings of the SIGDIAL 2013 Conference

2010

pdf bib
Modeling User Satisfaction Transitions in Dialogues from Overall Ratings
Ryuichiro Higashinaka | Yasuhiro Minami | Kohji Dohsaka | Toyomi Meguro
Proceedings of the SIGDIAL 2010 Conference

pdf bib
User-adaptive Coordination of Agent Communicative Behavior in Spoken Dialogue
Kohji Dohsaka | Atsushi Kanemoto | Ryuichiro Higashinaka | Yasuhiro Minami | Eisaku Maeda
Proceedings of the SIGDIAL 2010 Conference

pdf bib
Controlling Listening-oriented Dialogue using Partially Observable Markov Decision Processes
Toyomi Meguro | Ryuichiro Higashinaka | Yasuhiro Minami | Kohji Dohsaka
Proceedings of the 23rd International Conference on Computational Linguistics (Coling 2010)

pdf bib
Learning to Model Domain-Specific Utterance Sequences for Extractive Summarization of Contact Center Dialogues
Ryuichiro Higashinaka | Yasuhiro Minami | Hitoshi Nishikawa | Kohji Dohsaka | Toyomi Meguro | Satoshi Takahashi | Genichiro Kikui
Coling 2010: Posters

2009

pdf bib
Analysis of Listening-Oriented Dialogue for Building Listening Agents
Toyomi Meguro | Ryuichiro Higashinaka | Kohji Dohsaka | Yasuhiro Minami | Hideki Isozaki
Proceedings of the SIGDIAL 2009 Conference

pdf bib
Effects of Conversational Agents on Human Communication in Thought-Evoking Multi-Party Dialogues
Kohji Dohsaka | Ryota Asai | Ryuichiro Higashinaka | Yasuhiro Minami | Eisaku Maeda
Proceedings of the SIGDIAL 2009 Conference

1994

pdf bib
A Large-Vocabulary Continuous Speech Recognition Algorithm and Its Application to a Multi-Modal Telephone Directory Assistance System
Yasuhiro Minami | Kiyohiro Shikano | Osamu Yoshioka | Satoshi Takahashi | Tomokazu Yamada | Sadaoki Furui
Human Language Technology: Proceedings of a Workshop held at Plainsboro, New Jersey, March 8-11, 1994