Measuring the Perceived Importance of Speech Segments for Transmission over IP Networks(<Special Section> Multimedia QoS Evaluation and Management Technologies)
スポンサーリンク
概要
- 論文の詳細を見る
This paper presents a way of using a linear regression model to produce a single-valued criterion that indicates the perceived importance of each block in a stream of speech blocks. This method is superior to the conventional approach, voice activity detection (VAD), in that it provides a dynamically changing priority value for speech segments with finer granularity. The approach can be used in conjunction with scalable speech coding techniques in the context of IP QoS services to achieve a flexible form of quality control for speech transmission. A simple linear regression model is used to estimate a mean opinion score (MOS) of the various cases of missing speech segments. The estimated MOS is a continuous value that can be mapped to priority levels with arbitrary granularity. Through subjective evaluation, we show the validity of the calculated priority values.
- 社団法人電子情報通信学会の論文
- 2006-02-01
著者
-
Morinaga Toru
Ntt Cyber Space Laboratories Ntt Corporation:presently With Plala Networks Inc.
-
Kataoka Akitoshi
NTT Cyber Space Laboratories
-
HIWASAKI Yusuke
NTT Cyber Space Laboratories, NTT Corporation
-
IKEDO Jotaro
NTT Cyber Space Laboratories, NTT Corporation
-
Kataoka Akitoshi
Ntt Cyber Space Laboratories Ntt Corporation
-
Hiwasaki Yusuke
Ntt Cyber Space Laboratories Ntt Corporation
-
Ikedo Jotaro
Ntt Cyber Space Laboratories Ntt Corporation:presently With Ntt Resonant Inc.
-
KATAOKA Akitoshi
NTT Cyber Space Laboratories, NTT Corporation
関連論文
- New stereo echo canceller operating on single digital signal processor(Applied Systems)
- A G.711 Embedded Wideband Speech Coding for VoIP Conferences(Speech and Hearing)
- Measuring the Perceived Importance of Speech Segments for Transmission over IP Networks( Multimedia QoS Evaluation and Management Technologies)
- Noise Post-Processing for Low Bit-Rate CELP Coders(Speech and Hearing)
- Design of a Robust LSP Quantizer for a High-Quality 4-kbit/s CELP Speech Coder(Speech and Hearing)
- Robust Frequency Domain Acoustic Echo Cancellation Filter Employing Normalized Residual Echo Enhancement
- Gradient-Limited Affine Projection Algorithm for Double-Talk-Robust and Fast-Converging Acoustic Echo Cancellation(Engineering Acoustics)
- Frequency domain adaptive algorithm with nonlinear function of error-to-reference ratio for double-talk robust echo cancellation
- Enhancement of Sound Sources Located within a Particular Area Using a Pair of Small Microphone Arrays
- An Approach to Solve Local Minimum Problem in Sound Source and Microphone Localization(Engineering Acoustics)
- A Blind Source Localization by Using Freely Positioned Microphones(Special Section on Papers Selected from ITC-CSCC 2002)
- An Adaptive Microphone Array Using Multiple Fictitious Sources
- An adaptive microphone array for howling cancellation