Tackling interoperability issues within UIMA work flows

Nicolas Hernandez


Abstract
One of the major issues dealing with any workflow management frameworks is the components interoperability. In this paper, we are concerned with the Apache UIMA framework. We address the problem by considering separately the development of new components and the integration of existing tools. For the former objective, we propose an API to generically handle TS objects by their name using reflexivity in order to make the components TS-independent. In the latter case, we distinguish the case of aggregating heterogeneous TS-dependent UIMA components from the case of integrating non UIMA-native third party tools. We propose a mapper component to aggregate TS-dependent UIMA components. And we propose a component to wrap command lines third party tools and a set of components to connect various markup languages with the UIMA data structure. Finally, we present two situations where these solutions were effectively used: Training a POS tagger system from a treebank, and embedding an external POS tagger in a workflow. Our approch aims at providing quick development solutions.
Anthology ID:
L12-1667
Volume:
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
Month:
May
Year:
2012
Address:
Istanbul, Turkey
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Mehmet Uğur Doğan, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
3618–3625
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/1129_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Nicolas Hernandez. 2012. Tackling interoperability issues within UIMA work flows. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), pages 3618–3625, Istanbul, Turkey. European Language Resources Association (ELRA).
Cite (Informal):
Tackling interoperability issues within UIMA work flows (Hernandez, LREC 2012)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/1129_Paper.pdf