@sofiagruiz92

Interlingua-based English+óGé¼GC£Korean Two-way Speech Translation of Doctor+óGé¼GC£Patient Dialogues with CCLINC

, , and . Machine Translation, 17 (3): 213--243 (2002)

Abstract

Development of a robust two-way real-time speech translationsystem exposes researchers and system developers to various challenges of machine translation(MT) and spoken language dialogues. The need for communicating in at least two differentlanguages poses problems not present for a monolingual spoken language dialogue system,where no MT engine is embedded within the process flow. Integration of various componentmodules for real-time operation poses challenges not present for text translation. In this paper,we present the CCLINC (Common Coalition Language System at Lincoln Laboratory) English+óGé¼GC£Koreantwo-way speech translation system prototype trained on doctor+óGé¼GC£patient dialogues,which integrates various techniques to tackle the challenges of automatic real-time speechtranslation. Key features of the system include (i) language+óGé¼GC£independent meaning representation which preserves the hierarchicalpredicate+óGé¼GC£argument structure of an input utterance, providing a powerful mechanism for discourse understanding of utterances originating from different languages,word-sense disambiguation and generation of various word orders of many languages, (ii) adoptionof the DARPA Communicator architecture, a plug-and-play distributed system architecturewhich facilitates integration of component modules and system operation in real time, and (iii)automatic acquisition of grammar rules and lexicons for easy porting of the system to differentlanguages and domains. We describe these features in detail and present experimental results.

Links and resources

Tags