blob: c1bf24132b008f140525c6c1428fab85aa07f631 [file] [log] [blame]
Apache UIMA GaleMultiModalSample README file
GALE Multi-Modal NLP Sample Application
INTRODUCTION
------------
When complete this sample application will demonstrate how to implement a multi-modal UIMA pipeline
of NLP services capable of chaining together independently developed NLP engines to produce, for
example, a speech-to-speech application that transforms audio from one language to another, or one
that generates English captions for a foreign language news video.
It will include data-transformation routines that convert the output of one NLP engine into a format
suitable for input to another, while maintaining cross-reference links that allow, for example,
words in a translation to be aligned with the audio in the source language that generated them.
The sample NLP engines included will only simulate the real NLP work (e.g. speech recognition,
machine translation, entity detection, etc.) but examples of the usage of some open-source engines
may ne provided.
This sample multi-modal application is based on the one used to develop the GALE Interoperabilty
Demonstration system (IOD) that uses UIMA-AS to interconnect 11 different types of NLP analytics
distributed over 7 research facilities in 3 countries to transcribe, translate, and extract
information from foreign language news broadcasts [1].
The GALE TypeSystem (GTS) is spread over a number of files so that each NLP engine's types are
concentrated on one or two files [2].
CONTENTS
--------
Just the typesystem so far.
REFERENCES
----------
[1] J. F. Pitrelli, B. L. Lewis, E. A. Epstein, M. Franz, D. Kiecza, J. L. Quinn, G. Ramaswamy,
A. Srivastava, and P. Virga,
"Aggregating Distributed STT, MT, and Information Extraction Engines:
The GALE Interoperability-Demo System",
Proc. Interspeech 2008.
http://www.isca-speech.org/archive/interspeech_2008/i08_2743.html
[2] J. F. Pitrelli, B. L. Lewis, E. A. Epstein, J. L. Quinn, and G. Ramaswamy,
"A Data Format Enabling Interoperation of Speech Recognition, Translation and Information Extraction Engines:
The GALE Type System",
Proc. Interspeech 2008.
http://www.isca-speech.org/archive/interspeech_2008/i08_1654.html