Classifying Standard Linguistic Processing Functionalities based on Fundamental Data Operation Types

Yoshihiko Hayashi, Chiharu Narawa


Abstract
iIt is often argued that a set of standard linguistic processing functionalities should be identified,with each of them given a formal specification. We would benefit from the formal specifications; for example, the semi-automated composition of a complex language processing workflow could be enabled in due time. This paper extracts a standard set of linguistic processing functionalities and tries to classify them formally. To do this, we first investigated prominent types of language Web services/linguistic processors by surveying a Web-based language service infrastructure and published NLP toolkits. We next induced a set of standard linguistic processing functionalities by carefully investigating each of the linguistic processor types. The standard linguistic processing functionalities was then characterized by the input/output data types, as well as the required data operation types, which were also derived from the investigation. As a result, we came up with an ontological depiction that classifies linguistic processors and linguistic processing functionalities with respect to the fundamental data operation types. We argue that such an ontological depiction can explicitly describe the functional aspects of a linguistic processing functionality.
Anthology ID:
L12-1512
Volume:
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
Month:
May
Year:
2012
Address:
Istanbul, Turkey
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Mehmet Uğur Doğan, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
1169–1173
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/863_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Yoshihiko Hayashi and Chiharu Narawa. 2012. Classifying Standard Linguistic Processing Functionalities based on Fundamental Data Operation Types. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), pages 1169–1173, Istanbul, Turkey. European Language Resources Association (ELRA).
Cite (Informal):
Classifying Standard Linguistic Processing Functionalities based on Fundamental Data Operation Types (Hayashi & Narawa, LREC 2012)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/863_Paper.pdf