Automatic Alignment of Japanese-Chinese Bilingual Texts
スポンサーリンク
概要
- 論文の詳細を見る
Automatic alignment of bilingual texts is useful to example-based machine translation by facilitating the creation of example pairs of translation for the machine. Two main approaches to automatic alignment have been reported in the literature. They are lexical approach and statistical approach. The former looks for relationships between lexical contents of the bilingual texts in order to find alignment pairs, while the latter uses statistical correlation between sentence lengths of the bilingual texts as the basis of matching. This paper describes a combination of the two approaches in aligning Japanese-Chinese bilingual texts by allowing kanji contents and sentence lengths in the texts to work together in achieving an alignment process. Be-cause of the sentential structure differences between Japanese and Chinese, matching at the sentence level may result in frequent matching between a number of sentences en masses. In view of this, the current work also attempts to create shorter align-ment pairs by permitting sentences to be matched with clauses or phrases of the other text if possible. While such matching is more difficult and error-prone, the reliance on kanji contents has proven to be very useful in minimizing the errors. The current research has thus found solutions to problems that are unique to the present work.
- 社団法人電子情報通信学会の論文
- 1995-01-25
著者
-
Nagao Makoto
Faculty of Engineering, Kyoto University
-
Nagao Makoto
Faculty Of Engineering Kyoto University
-
Nagao M
Faculty Of Engineering Kyoto University
-
Tan Chew
Department of Information Systems & Computer Science, National University of Singapore
-
Tan C
National Univ. Singapore Singapore
関連論文
- Hierarchy-Based Networked Organization, Modeling, and Prototyping of Semantic, Statistic, and Numeric Image Information
- Automatic Alignment of Japanese-Chinese Bilingual Texts
- A Practical Structural Representation of a Segmented Image
- A Method of Case Structure Analysis for Japanese Sentences Based on Examples in Case Frame Dictionary (Special Issue on Natural Language Processing and Understanding)