Improved CELP-Based Coding in a Noisy Environment Using a Trained Sparse Conjugate Codebook
スポンサーリンク
概要
- 論文の詳細を見る
A trained sparse conjugate codebook is proposed for improving the speech quality of CELP-based coding in a noisy environment. Although CELP coding provides high quality at a low bit rate in a silent environment (creating clean speech), it cannot provide a satisfactory quality in a noisy environment because the conventional fixed codebook is designed to be suitable for clean speech. The proposed codebook consists of two sub-codebooks; each sub-codebook consists of a random component and a trained component. Each component has excitation vectors consisting of a few pulses. In the random component, pulse position and amplitude are determined randomly. Since the random component does not depend on the speech characteristics, it handles noise better than the trained one. The trained component maintains high quality for clean speech. Since excitation vector is the sum of the two sub-excitation vectors, this codebook handles various speech conditions by selecting a sub-vector from each component. This codebook also reduces the computational complexity of a fixed codebook search and memory requirements compared with the conventional codebook. Subjective testing (absolute category rating (ACR) and degradation category rating (DCR)) indicated that this codebook improves speech quality compared with the conventional trained codebook for noisy speech. The ACR test showed that the quality of the 8 kbit/s CELP coder with this codebook is equivalent to that of the 32 kbit/s ADPCM for clean speech.
- 1996-02-25
著者
-
Hayashi Shinji
Ntt Human Interface Labs.
-
KATAOKA Akitoshi
Faculty of Science and Technology, Ryukoku University
-
Hayashi S
Univ. Electro‐communications Chofu Jpn
-
MORIYA Takehiro
NTT Cyber Space Laboratories NTT Corporation
-
Moriya T
Ntt Cyber Space Laboratories Ntt Corporation
-
Moriya Takehiro
Ntt Human Interface Laboratories
-
KATAOKA Akitoshi
NTT Human Interface Labs.
-
KURIHARA Sachiko
NTT Human Interface Labs.
-
Kataoka A
Faculty Of Science And Technology Ryukoku University
-
Kataoka Akitoshi
Ntt Human Interface Laboratories
-
Hayashi Shinji
Ntt Human Interface Laboratories
関連論文
- Improved CELP-Based Coding in a Noisy Environment Using a Trained Sparse Conjugate Codebook
- Improving Power Spectra Estimation in 2-Dimensional Areas Using Number of Active Sound Sources
- Nitrogen Doping in GaP Microcrystals : A Photoluminescence Study
- Raman Study of Crystal Structure of Gas-Evaporated MoO_3 Microcrystals
- A 6.4-kbit/s Variable-Bit-Rate Extension to the G.729 (CS-ACELP) Speech Coder
- Photoluminescence from Si1-xGex alloy nanocrystals
- Size-dependent near-infrared photoluminescence from Ge nanocrystals embedded in SiO2 matrices
- Photoluminescence of Si-Rich SiO_2 Films : Si Clusters as Luminescent Centers
- Raman Scattering from Acoustic Phonons Confined in Microcrystals : Small Gold and Silver Particles Embedded in SiO_2 Thin Films
- Growth of Ge Microcrystals in SiO_2 Thin Film Matrices : A Raman and Electron Microscopic Study
- Quantum Size Effects in Ge Microcrystals Embedded in SiO_2 Thin Films
- Implications of Broad Raman Spectra of Sputtered SiGe Alloy Films : Condensed Matter
- Quality Evaluation and Improvement of MPEG-4 TwinVQ Scalable Audio Coding under Packet Loss Condition
- Lossless Scalable Audio Coding and Quality Enhancement (Special Issue on Speech Information Processing)
- An Objective Measure Based on an Auditory Model for Assessing Low-Rate Coded Speech
- A Remote Auscultation Support System Using Network
- Comparison of Two Speech and Audio Coders at 8 kb/s from the Viewpoints of Coding Scheme and Quality (Special Issue on Performance and Quality of Service (QoS) of Multimedia Networks
- Robust Frequency Domain Acoustic Echo Cancellation Filter Employing Normalized Residual Echo Enhancement
- Gradient-Limited Affine Projection Algorithm for Double-Talk-Robust and Fast-Converging Acoustic Echo Cancellation(Engineering Acoustics)
- Frequency domain adaptive algorithm with nonlinear function of error-to-reference ratio for double-talk robust echo cancellation
- Enhancement of Sound Sources Located within a Particular Area Using a Pair of Small Microphone Arrays
- Coding of LSP Parameters Using Interframe Moving Average Prediction and Multi-Stage Vector Quantization (Special Section of Letters Selected from the 1993 IEICE Spring Conference)
- Pitch Synchronous Innovation CELP (PSI-CELP) (Special Section of Letters Selected from the 1993 IEICE Spring Conference)
- Below bulk-band-gap photoluminescence at room temperature from heavily P- and B-doped Si nanocrystals
- Control of photoluminescence properties of Si nanocrystals by simultaneously doping n- and p-type impurities
- Photoluminescence from impurity codoped and compensated Si nanocrystals
- Enhanced optical properties of Si1-xGex alloy nanocrystals in a planar microcavity
- Improving Power Spectra Estimation in 2-Dimensional Areas Using Number of Active Sound Sources
- An Approach to Solve Local Minimum Problem in Sound Source and Microphone Localization(Engineering Acoustics)
- FOREWORD (Special Section of Letters Selected from the 1995 Society Conference of IEICE)
- A Blind Source Localization by Using Freely Positioned Microphones(Special Section on Papers Selected from ITC-CSCC 2002)
- An Adaptive Microphone Array Using Multiple Fictitious Sources
- An adaptive microphone array for howling cancellation
- A microphone-array configuration for AMNOR : Adaptive microphone-array system for noise reduction
- Procedure for estimating fluctuation strength from tremolo by irregular plucking of mandolin