A Reordering Model Using a Source-Side Parse-Tree for Statistical Machine Translation
スポンサーリンク
概要
- 論文の詳細を見る
This paper presents a reordering model using a source-side parse-tree for phrase-based statistical machine translation. The proposed model is an extension of IST-ITG (imposing source tree on inversion transduction grammar) constraints. In the proposed method, the target-side word order is obtained by rotating nodes of the source-side parse-tree. We modeled the node rotation, monotone or swap, using word alignments based on a training parallel corpus and source-side parse-trees. The model efficiently suppresses erroneous target word orderings, especially global orderings. Furthermore, the proposed method conducts a probabilistic evaluation of target word reorderings. In English-to-Japanese and English-to-Chinese translation experiments, the proposed method resulted in a 0.49-point improvement (29.31 to 29.80) and a 0.33-point improvement (18.60 to 18.93) in word BLEU-4 compared with IST-ITG constraints, respectively. This indicates the validity of the proposed reordering model.
- (社)電子情報通信学会の論文
- 2009-12-01
著者
-
TOKUDA Keiichi
Department of Computer Science and Engineering, Nagoya Institute of Technology
-
YAMAMOTO Hirofumi
National Institute of Information and Communications Technology
-
SUMITA Eiichiro
National Institute of Information and Communications Technology
-
Sumita Eiichiro
National Inst. Communications Technol. Kyoto‐fu Jpn
-
HASHIMOTO Kei
Department of Bioproductive Sciences, Utsunomiya University
-
Hashimoto Kei
Department Of Bioproductive Sciences Utsunomiya University
-
Tokuda Keiichi
Department Of Computer Science And Engineering Nagoya Institute Of Technology
-
OKUMA Hideo
National Institute of Information and Communications Technology
-
Okuma Hideo
National Institute Of Communications Technology
-
Hashimoto Kei
Department Of Computer Science And Engineering Nagoya Institute Of Technology
-
Yamamoto Hirofumi
Atr Spoken Language Translation Res. Lab. Kyoto‐fu Jpn
-
Yamamoto Hirofumi
National Inst. Information And Communications Technol. Kyoto‐fu Jpn
-
HASHIMOTO Kei
Department of Applied Biological Chemistry, Utsunomiya University
関連論文
- Constraining a Generative Word Alignment Model with Discriminative Output
- The Nitech-NAIST HMM-Based Speech Synthesis System for the Blizzard Challenge 2006
- Details of the Nitech HMM-Based Speech Synthesis System for the Blizzard Challenge 2005(Speech and Herring)
- Antioxidative Effects of Phenolic Acids on Lipid Peroxidation Induced by H_2O_2 in the Presence of Myoglobin
- Determination of Hydrogen Peroxide by High-Performance Liquid Chromatography with a Cation-Exchange Resin Gel Column and Electrochemical Detector
- Absorption and Metabolism of Quercetin in Caco-2 Cells
- Applying Sparse KPCA for Feature Extraction in Speech Recognition(Feature Extraction and Acoustic Medelings, Corpus-Based Speech Technologies)
- On the Use of Kernel PCA for Feature Extraction in Speech Recognition(Speech and Hearing)
- The Nitech-NAIST HMM-Based Speech Synthesis System for the Blizzard Challenge 2006
- A Hidden Semi-Markov Model-Based Speech Synthesis System(Speech and Hearing)
- State Duration Modeling for HMM-Based Speech Synthesis(Speech and Hearing)
- A Training Method of Average Voice Model for HMM-Based Speech Synthesis(Digital Signal Processing)
- A Context Clustering Technique for Average Voice Models (Special Issue on Speech Information Processing)
- Multi-Space Probability Distribution HMM(Special Issue on the 2000 IEICE Excellent Paper Award)
- A Reordering Model Using a Source-Side Parse-Tree for Statistical Machine Translation
- Splitting Input for Machine Translation Using N-gram Language Model Together with Utterance Similarity(Natural Language Processing)
- Spectral Cosensitization in Organic Solar Cell with Mixed Film of Zinc Porphyrin and Merocyanine
- A Fully Consistent Hidden Semi-Markov Model-Based Speech Recognition System
- E_019 Achilles : A Chinese Morphological Analyzer
- LMS-Based Algorithms with Multi-Band Decomposition of the Estimation Error Applied to System Identification (Special Section on Digital Signal Processing)
- Multi-Band Decomposition of the Linear Prediction Error Applied to Adaptive AR Spectral Estimation
- Inhibitory Effect of Arphamenine A on Intestinal Dipeptide Transport
- Imposing Constraints from the Source Tree on ITG Constraints for SMT
- Introducing a Translation Dictionary into Phrase-Based SMT
- Adaptive AR Spectral Estimation Based on Wavelet Decomposition of the Linear Prediction Error
- A Covariance-Typing Technique for HMM-Based Speech Synthesis
- Effects of β-Lactoglobulin on the Tight-junctional Stability of Caco-2-SF Monolayer
- Training Set Selection for Building Compact and Efficient Language Models
- Constraining a Generative Word Alignment Model with Discriminative Output
- Parameter Sharing in Mixture of Factor Analyzers for Speaker Identification(Feature Extraction and Acoustic Medelings, Corpus-Based Speech Technologies)
- Bilingual Cluster Based Models for Statistical Machine Translation
- Statistical Language Model Adaptation with Additional Text Generated by Machine Translation
- Suppression of the Menadione-Induced Cytotoxicity toward Hepalclc7 Murine Hepatoma by Quinone Reductase Inducers
- Deterministic Annealing EM Algorithm in Acoustic Modeling for Speaker and Speech Recognition(Feature Extraction and Acoustic Medelings, Corpus-Based Speech Technologies)
- Continuous Speech Recognition Based on General Factor Dependent Acoustic Models(Feature Extraction and Acoustic Medelings, Corpus-Based Speech Technologies)
- Bayesian Context Clustering Using Cross Validation for Speech Recognition
- Speech recognition based on statistical models including multiple phonetic decision trees
- Paraphrase Lattice for Statistical Machine Translation
- An Empirical Comparison of Parsers in Constraining Reordering for E-J Patent Machine Translation
- Japanese Argument Reordering Based on Dependency Structure for Statistical Machine Translation
- An Empirical Comparison of Parsers in Constraining Reordering for E-J Patent Machine Translation
- A Bayesian Framework Using Multiple Model Structures for Speech Recognition
- Database of Human Evaluations of Machine Translation Systems for Patent Translation
- Speaker interpolation for HMM-based speech synthesis system
- How to Translate Dialects: A Segmentation-Centric Pivot Translation Approach
- Joint Phrase Alignment and Extraction for Statistical Machine Translation
- Joint Phrase Alignment and Extraction for Statistical Machine Translation
- Inhibitory Effect of Methyl Methanethiosulfinate on β-Glucuronidase Activity
- Database of Human Evaluations of Machine Translation Systems for Patent Translation