Yasuhiro Minami
2022
Dataset Construction for Scientific-Document Writing Support by Extracting Related Work Section and Citations from PDF Papers
Keita Kobayashi | Kohei Koyama | Hiromi Narimatsu | Yasuhiro Minami
Proceedings of the Thirteenth Language Resources and Evaluation Conference
Keita Kobayashi | Kohei Koyama | Hiromi Narimatsu | Yasuhiro Minami
Proceedings of the Thirteenth Language Resources and Evaluation Conference
To augment datasets used for scientific-document writing support research, we extract texts from “Related Work” sections and citation information in PDF-formatted papers published in English. The previous dataset was constructed entirely with Tex-formatted papers, from which it is easy to extract citation information. However, since many publicly available papers in various fields are provided only in PDF format, a dataset constructed using only Tex papers has limited utility. To resolve this problem, we augment the existing dataset by extracting the titles of sections using the visual features of PDF documents and extracting the Related Work section text using the explicit title information. Since text generated from the figures and footnotes appearing in the extraction target areas is considered noise, we remove instances of such text. Moreover, we map the cited paper’s information obtained using existing tools to citation marks detected by regular expression rules, resulting in pairs of cited paper information and text of the Related Work section. By evaluating body text extraction and citation mapping in the constructed dataset, the accuracy of the proposed dataset was found to be close to that of the previous dataset. Accordingly, we demonstrated the possibility of building a significantly augmented dataset.
2021
Task Definition and Integration For Scientific-Document Writing Support
Hiromi Narimatsu | Kohei Koyama | Kohji Dohsaka | Ryuichiro Higashinaka | Yasuhiro Minami | Hirotoshi Taira
Proceedings of the Second Workshop on Scholarly Document Processing
Hiromi Narimatsu | Kohei Koyama | Kohji Dohsaka | Ryuichiro Higashinaka | Yasuhiro Minami | Hirotoshi Taira
Proceedings of the Second Workshop on Scholarly Document Processing
With the increase in the number of published academic papers, growing expectations have been placed on research related to supporting the writing process of scientific papers. Recently, research has been conducted on various tasks such as citation worthiness (judging whether a sentence requires citation), citation recommendation, and citation-text generation. However, since each task has been studied and evaluated using data that has been independently developed, it is currently impossible to verify whether such tasks can be successfully pipelined to effective use in scientific-document writing. In this paper, we first define a series of tasks related to scientific-document writing that can be pipelined. Then, we create a dataset of academic papers that can be used for the evaluation of each task as well as a series of these tasks. Finally, using the dataset, we evaluate the tasks of citation worthiness and citation recommendation as well as both of these tasks integrated. The results of our evaluations show that the proposed approach is promising.
2018
Infant Word Comprehension-to-Production Index Applied to Investigation of Noun Learning Predominance Using Cross-lingual CDI database
Yasuhiro Minami | Tessei Kobayashi | Yuko Okumura
Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)
Yasuhiro Minami | Tessei Kobayashi | Yuko Okumura
Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)
Analyzing Vocabulary Commonality Index Using Large-scaled Database of Child Language Development
Yan Cao | Yasuhiro Minami | Yuko Okumura | Tessei Kobayashi
Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)
Yan Cao | Yasuhiro Minami | Yuko Okumura | Tessei Kobayashi
Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)
2013
Open-domain Utterance Generation for Conversational Dialogue Systems using Web-scale Dependency Structures
Hiroaki Sugiyama | Toyomi Meguro | Ryuichiro Higashinaka | Yasuhiro Minami
Proceedings of the SIGDIAL 2013 Conference
Hiroaki Sugiyama | Toyomi Meguro | Ryuichiro Higashinaka | Yasuhiro Minami
Proceedings of the SIGDIAL 2013 Conference
2010
Controlling Listening-oriented Dialogue using Partially Observable Markov Decision Processes
Toyomi Meguro | Ryuichiro Higashinaka | Yasuhiro Minami | Kohji Dohsaka
Proceedings of the 23rd International Conference on Computational Linguistics (Coling 2010)
Toyomi Meguro | Ryuichiro Higashinaka | Yasuhiro Minami | Kohji Dohsaka
Proceedings of the 23rd International Conference on Computational Linguistics (Coling 2010)
Learning to Model Domain-Specific Utterance Sequences for Extractive Summarization of Contact Center Dialogues
Ryuichiro Higashinaka | Yasuhiro Minami | Hitoshi Nishikawa | Kohji Dohsaka | Toyomi Meguro | Satoshi Takahashi | Genichiro Kikui
Coling 2010: Posters
Ryuichiro Higashinaka | Yasuhiro Minami | Hitoshi Nishikawa | Kohji Dohsaka | Toyomi Meguro | Satoshi Takahashi | Genichiro Kikui
Coling 2010: Posters
Modeling User Satisfaction Transitions in Dialogues from Overall Ratings
Ryuichiro Higashinaka | Yasuhiro Minami | Kohji Dohsaka | Toyomi Meguro
Proceedings of the SIGDIAL 2010 Conference
Ryuichiro Higashinaka | Yasuhiro Minami | Kohji Dohsaka | Toyomi Meguro
Proceedings of the SIGDIAL 2010 Conference
User-adaptive Coordination of Agent Communicative Behavior in Spoken Dialogue
Kohji Dohsaka | Atsushi Kanemoto | Ryuichiro Higashinaka | Yasuhiro Minami | Eisaku Maeda
Proceedings of the SIGDIAL 2010 Conference
Kohji Dohsaka | Atsushi Kanemoto | Ryuichiro Higashinaka | Yasuhiro Minami | Eisaku Maeda
Proceedings of the SIGDIAL 2010 Conference
2009
Analysis of Listening-Oriented Dialogue for Building Listening Agents
Toyomi Meguro | Ryuichiro Higashinaka | Kohji Dohsaka | Yasuhiro Minami | Hideki Isozaki
Proceedings of the SIGDIAL 2009 Conference
Toyomi Meguro | Ryuichiro Higashinaka | Kohji Dohsaka | Yasuhiro Minami | Hideki Isozaki
Proceedings of the SIGDIAL 2009 Conference
Effects of Conversational Agents on Human Communication in Thought-Evoking Multi-Party Dialogues
Kohji Dohsaka | Ryota Asai | Ryuichiro Higashinaka | Yasuhiro Minami | Eisaku Maeda
Proceedings of the SIGDIAL 2009 Conference
Kohji Dohsaka | Ryota Asai | Ryuichiro Higashinaka | Yasuhiro Minami | Eisaku Maeda
Proceedings of the SIGDIAL 2009 Conference
1994
A Large-Vocabulary Continuous Speech Recognition Algorithm and Its Application to a Multi-Modal Telephone Directory Assistance System
Yasuhiro Minami | Kiyohiro Shikano | Osamu Yoshioka | Satoshi Takahashi | Tomokazu Yamada | Sadaoki Furui
Human Language Technology: Proceedings of a Workshop held at Plainsboro, New Jersey, March 8-11, 1994
Yasuhiro Minami | Kiyohiro Shikano | Osamu Yoshioka | Satoshi Takahashi | Tomokazu Yamada | Sadaoki Furui
Human Language Technology: Proceedings of a Workshop held at Plainsboro, New Jersey, March 8-11, 1994