GaleMultiModalExample/README.txt - uima-sandbox - Git at Google

      Apache UIMA GaleMultiModalSample README file

    GALE Multi-Modal NLP Sample Application

 INTRODUCTION
 ------------

 When complete this sample application will demonstrate how to implement a multi-modal UIMA pipeline
 of NLP services capable of chaining together independently developed NLP engines to produce, for
 example, a speech-to-speech application that transforms audio from one language to another, or one
 that generates English captions for a foreign language news video.
 It will include data-transformation routines that convert the output of one NLP engine into a format
 suitable for input to another, while maintaining cross-reference links that allow, for example,
 words in a translation to be aligned with the audio in the source language that generated them.
 The sample NLP engines included will only simulate the real NLP work (e.g. speech recognition,
 machine translation, entity detection, etc.) but examples of the usage of some open-source engines
 may ne provided.

 This sample multi-modal application is based on the one used to develop the GALE Interoperabilty
 Demonstration system (IOD) that uses UIMA-AS to interconnect 11 different types of NLP analytics
 distributed over 7 research facilities in 3 countries to transcribe, translate, and extract
 information from foreign language news broadcasts [1].
 The GALE TypeSystem (GTS) is spread over a number of files so that each NLP engine's types are
 concentrated on one or two files [2].

 CONTENTS
 --------

 Just the typesystem so far.


 REFERENCES
 ----------

 [1] J. F. Pitrelli, B. L. Lewis, E. A. Epstein, M. Franz, D. Kiecza, J. L. Quinn, G. Ramaswamy,
 A. Srivastava, and P. Virga,
 "Aggregating Distributed STT, MT, and Information Extraction Engines:
  The GALE Interoperability-Demo System",
 Proc. Interspeech 2008.
 http://www.isca-speech.org/archive/interspeech_2008/i08_2743.html

 [2] J. F. Pitrelli, B. L. Lewis, E. A. Epstein, J. L. Quinn, and G. Ramaswamy,
 "A Data Format Enabling Interoperation of Speech Recognition, Translation and Information Extraction Engines:
  The GALE Type System",
 Proc. Interspeech 2008.
 http://www.isca-speech.org/archive/interspeech_2008/i08_1654.html
	Apache UIMA GaleMultiModalSample README file

	GALE Multi-Modal NLP Sample Application

	INTRODUCTION
	------------

	When complete this sample application will demonstrate how to implement a multi-modal UIMA pipeline
	of NLP services capable of chaining together independently developed NLP engines to produce, for
	example, a speech-to-speech application that transforms audio from one language to another, or one
	that generates English captions for a foreign language news video.
	It will include data-transformation routines that convert the output of one NLP engine into a format
	suitable for input to another, while maintaining cross-reference links that allow, for example,
	words in a translation to be aligned with the audio in the source language that generated them.
	The sample NLP engines included will only simulate the real NLP work (e.g. speech recognition,
	machine translation, entity detection, etc.) but examples of the usage of some open-source engines
	may ne provided.

	This sample multi-modal application is based on the one used to develop the GALE Interoperabilty
	Demonstration system (IOD) that uses UIMA-AS to interconnect 11 different types of NLP analytics
	distributed over 7 research facilities in 3 countries to transcribe, translate, and extract
	information from foreign language news broadcasts [1].
	The GALE TypeSystem (GTS) is spread over a number of files so that each NLP engine's types are
	concentrated on one or two files [2].

	CONTENTS
	--------

	Just the typesystem so far.


	REFERENCES
	----------

	[1] J. F. Pitrelli, B. L. Lewis, E. A. Epstein, M. Franz, D. Kiecza, J. L. Quinn, G. Ramaswamy,
	A. Srivastava, and P. Virga,
	"Aggregating Distributed STT, MT, and Information Extraction Engines:
	The GALE Interoperability-Demo System",
	Proc. Interspeech 2008.
	http://www.isca-speech.org/archive/interspeech_2008/i08_2743.html

	[2] J. F. Pitrelli, B. L. Lewis, E. A. Epstein, J. L. Quinn, and G. Ramaswamy,
	"A Data Format Enabling Interoperation of Speech Recognition, Translation and Information Extraction Engines:
	The GALE Type System",
	Proc. Interspeech 2008.
	http://www.isca-speech.org/archive/interspeech_2008/i08_1654.html