papers AI Learner
The Github is limit! Click to go to the new site.

A Lexicalist Approach to the Translation of Colloquial Text

1997-06-18
Fred Popowich, Davide Turcato, Olivier Laurens, Paul McFetridge, J. Devlan Nicholson, Patrick McGivern, Maricela Corzo Pena, Lisa Pidruchney, Scott MacDonald (Simon Fraser University, Burnaby, Canada; TCC Communications, Victoria, Canada)

Abstract

Colloquial English (CE) as found in television programs or typical conversations is different than text found in technical manuals, newspapers and books. Phrases tend to be shorter and less sophisticated. In this paper, we look at some of the theoretical and implementational issues involved in translating CE. We present a fully automatic large-scale multilingual natural language processing system for translation of CE input text, as found in the commercially transmitted closed-caption television signal, into simple target sentences. Our approach is based on the Whitelock’s Shake and Bake machine translation paradigm, which relies heavily on lexical resources. The system currently translates from English to Spanish with the translation modules for Brazilian Portuguese under development.

Abstract (translated by Google)

在电视节目或典型对话中发现的口语英语(CE)与技术手册,报纸和书籍中的文本不同。短语往往更短,不太复杂。在本文中,我们看一些涉及翻译CE的理论和实施问题。我们提供了一个全自动的大型多语言自然语言处理系统,用于在商业传输的隐藏字幕电视信号中找到的用于CE输入文本的翻译成简单的目标语句。我们的方法是基于Whitelock的Shake and Bake机器翻译范式,这种范式在很大程度上依赖于词汇资源。该系统目前正在翻译从英文到西班牙文,正在开发的巴西葡萄牙语翻译模块。

URL

https://arxiv.org/abs/cmp-lg/9706024

PDF

https://arxiv.org/pdf/cmp-lg/9706024


Similar Posts

Comments