A G.711 Embedded Wideband Speech Coding for VoIP Conferences(Speech and Hearing)
スポンサーリンク
概要
- 論文の詳細を見る
This paper proposes a wideband speech coder in which a G.711 bitstream is embedded. This coder has an advantage over conventional coders in that it has a high interoperability with existing terminals so costly transcoding involving decoding and re-encoding can be avoided. We also propose a partial mixing method that effectively reduces the mixing complexity in multiple-point remote conferences. To reduce the complexity, we take advantage of the scalable structure of the bitstream and mix only the lower band of the signal. For the higher band, the main speaker location is selected among remote locations and is redistributed with the mixed lower-band signal. By subjective evaluations, we show that the speech quality can be maintained even when the speech signals are partially mixed.
- 2006-09-01
著者
-
MORI Takeshi
NTT Cyber Space Laboratories NTT Corporation
-
Kataoka Akitoshi
NTT Cyber Space Laboratories
-
Kataoka Akitoshi
Ntt Cyber Space Laboratories Ntt Corporation
-
Hiwasaki Yusuke
Ntt Cyber Space Laboratories Ntt Corporation
-
OHMURO Hitoshi
NTT Cyber Space Laboratories, NTT Corporation
-
KURIHARA Sachiko
NTT Cyber Space Laboratories, NTT Corporation
-
Ohmuro Hitoshi
Ntt Cyber Space Laboratories Ntt Corporation
-
Kurihara Sachiko
Ntt Cyber Space Laboratories Ntt Corporation
関連論文
- New stereo echo canceller operating on single digital signal processor(Applied Systems)
- A G.711 Embedded Wideband Speech Coding for VoIP Conferences(Speech and Hearing)
- Quality Evaluation and Improvement of MPEG-4 TwinVQ Scalable Audio Coding under Packet Loss Condition
- Lossless Scalable Audio Coding and Quality Enhancement (Special Issue on Speech Information Processing)
- Measuring the Perceived Importance of Speech Segments for Transmission over IP Networks( Multimedia QoS Evaluation and Management Technologies)
- Noise Post-Processing for Low Bit-Rate CELP Coders(Speech and Hearing)
- Design of a Robust LSP Quantizer for a High-Quality 4-kbit/s CELP Speech Coder(Speech and Hearing)
- Robust Frequency Domain Acoustic Echo Cancellation Filter Employing Normalized Residual Echo Enhancement
- Gradient-Limited Affine Projection Algorithm for Double-Talk-Robust and Fast-Converging Acoustic Echo Cancellation(Engineering Acoustics)
- Frequency domain adaptive algorithm with nonlinear function of error-to-reference ratio for double-talk robust echo cancellation
- Enhancement of Sound Sources Located within a Particular Area Using a Pair of Small Microphone Arrays
- An Approach to Solve Local Minimum Problem in Sound Source and Microphone Localization(Engineering Acoustics)
- A Blind Source Localization by Using Freely Positioned Microphones(Special Section on Papers Selected from ITC-CSCC 2002)
- An Adaptive Microphone Array Using Multiple Fictitious Sources
- An adaptive microphone array for howling cancellation