A 16kb/s Wideband CELP-Based Speech Coder Using Mel-Generalized Cepstral Analysis

概要

論文の詳細を見る
We propose a wideband CELP-type speech coder at 16kb/s based on a mel-generalized cepstral(MGC)analysis technique. MGC analysis makes it possible to obtain a more accurate representation of spectral zeros compared to linear predictive(LP) analysis and take a perceptual frequency scale into account. A major advantage of the proposed coder is that the benefits of MGC representation of speech spectra can be incorporated into the CELP coding process. Subjective tests show that the proposed coder at 16kb/s achieves a significant improvement in performance over a 16kb/s conventional CELP coder under the same coding framework and bit allocation. Moreover, the proposed coder is found to outperform the ITU-T G.722 standard at 64kb/s.
社団法人電子情報通信学会の論文
2000-04-25

著者

Tokuda K
Department Of Computer Science And Engineering Nagoya Institute Of Technology
Tokuda Keiichi
The Authors Are With The Nagoya Institute Of Technology
KOBAYASHI Takao
Interdisciplinary Graduate School of Science and Engineering, Tokyo Institute of Technology
KOBAYASHI Takao
The authors are with the Department of Information Processing, Interdisciplinary Graduate School of
Kobayashi Takao
Tokyo Inst. Technol. Yokohama‐shi Jpn
Tokuda Keiichi
The Department Of Computer Science Nagoya Institute Of Technology
Kobayashi T
Interdisciplinary Graduate School Of Science And Engineering Tokyo Institute Of Technology
KOISHIDA Kazuhito
The authors are with Interdisciplinary Graduate School of Science and Engineering, Tokyo Institute o
HIRABAYASHI Gou
The authors are with Interdisciplinary Graduate School of Science and Engineering, Tokyo Institute o
Hirabayashi Gou
The Authors Are With Interdisciplinary Graduate School Of Science And Engineering Tokyo Institute Of
Koishida Kazuhito
The Interdisciplinary Graduate School Of Science And Engineering Tokyo Institute Of Technology:signa
Kobayashi Takao
Department Of Obstetrics And Gynecology Hamamatsu University School Of Medicine
Hirabayashi Go
The authors are with Interdisciplinary Graduate School of Science and Engineering, Tokyo Institute of Technology
TOKUDA Keiichi
The author is with the Department of Computer Science, Nagoya Institute of Technology

関連論文

A Style Control Technique for HMM-Based Expressive Speech Synthesis(Speech and Hearing)
A Style Adaptation Technique for Speech Synthesis Using HSMM and Suprasegmental Features(Speech Synthesis, Statistical Modeling for Speech Processing)
Speech Synthesis with Various Emotional Expressions and Speaking Styles by Style Interpolation and Morphing(Life-like Agent and its Communication)
Acoustic Modeling of Speaking Styles and Emotional Expressions in HMM-Based Speech Synthesis(Speech Synthesis and Prosody, Corpus-Based Speech Technologies)
A Hidden Semi-Markov Model-Based Speech Synthesis System(Speech and Hearing)
State Duration Modeling for HMM-Based Speech Synthesis(Speech and Hearing)
A Training Method of Average Voice Model for HMM-Based Speech Synthesis(Digital Signal Processing)
A Context Clustering Technique for Average Voice Models (Special Issue on Speech Information Processing)
Speaker Adaptation of Pitch and Spectrum for HMM-Based Speech Synthesis
Multi-Space Probability Distribution HMM(Special Issue on the 2000 IEICE Excellent Paper Award)
Vector Quantization of Speech Spectral Parameters Using Statistics of Static and Dynamic Features
Text-Independent Speaker Identification Using Gaussian Mixture Models Based on Multi-Space Probability Distribution (Special Issue on Biometric Person Authentication)
A Fully Consistent Hidden Semi-Markov Model-Based Speech Recognition System
Mixture Density Models Based on Mel-Cepstral Representation of Gaussian Process(Digital Signal Processing)
A 16kb/s Wideband CELP-Based Speech Coder Using Mel-Generalized Cepstral Analysis
Development of Material Management System for Newspapers (Special Issue on New Generation Database Technologies)
Conidiomatal development of Pestalotiopsis guepinii and P. neglecta on leaves of Gardenia jasminoides
Pycnidial development of Phyllosticta harai and Sphaeropsis sp.
LMS-Based Algorithms with Multi-Band Decomposition of the Estimation Error Applied to System Identification (Special Section on Digital Signal Processing)
Multi-Band Decomposition of the Linear Prediction Error Applied to Adaptive AR Spectral Estimation
Robust F_0 Estimation of Speech Signal Using Harmonicity Measure Based on Instantaneous Frequency(Speech and Hearing)
Adaptive AR Spectral Estimation Based on Wavelet Decomposition of the Linear Prediction Error
A Covariance-Typing Technique for HMM-Based Speech Synthesis
An autopsy case of cyclopia with 13 trisomy with special reference to histological abnormalities of the eyeball
Acrania : an autopsy case and review of the literature
Parameter Sharing in Mixture of Factor Analyzers for Speaker Identification(Feature Extraction and Acoustic Medelings, Corpus-Based Speech Technologies)
Bayesian Context Clustering Using Cross Validation for Speech Recognition
Lip Location Normalized Training for Visual Speech Recognition
FOREWORD
An Extension of Separable Lattice 2-D HMMs for Rotational Data Variations
A Bayesian Framework Using Multiple Model Structures for Speech Recognition

A 16kb/s Wideband CELP-Based Speech Coder Using Mel-Generalized Cepstral Analysis

スポンサーリンク

概要

著者

関連論文

スポンサーリンク