Detection of Overlapping Speech in Meetings Using Support Vector Machines and Support Vector Regression(Engineering Acoustics)
スポンサーリンク
概要
- 論文の詳細を見る
In this paper, a method of detecting overlapping speech segments in meetings is proposed. It is known that the eigenvalue distribution of the spatial correlation matrix calculated from a multiple microphone input reflects information on the number and relative power of sound sources. However, in a reverberant sound field, the feature of the number of sources in the eigenvalue distribution is degraded by the room reverberation. In the Support Vector Machines approach, the eigenvalue distribution is classified into two classes (overlapping speech segments and single speech segments). In the Support Vector Regression approach, the relative power of sound sources is estimated by using the eigenvalue distribution, and overlapping speech segments are detected based on the estimated relative power. The salient feature of this approach is that the sensitivity of detecting overlapping speech segments can be controlled simply by changing the threshold value of the relative power. The proposed method was evaluated using recorded data of an actual meeting.
- 社団法人電子情報通信学会の論文
- 2006-08-01
著者
-
Asano F
Graduate School Of Information Science Tohoku University
-
Asano Futoshi
Media Interaction Group Information Technology Research Institute Aist
-
YAMADA Takeshi
Graduate School of Urban Environmental Sciences, Tokyo Metropolitan University
-
KITAWAKI Nobuhiko
Graduate School of Systems and Information Engineering, University of Tsukuba
-
YAMAMOTO Kiyoshi
Graduate School of Systems and Information Engineering, University of Tsukuba
-
Yamada T
University Of Tsukuba
-
Kitawaki N
Graduate School Of Systems And Information Engineering University Of Tsukuba
-
Kitawaki Nobuhiko
Graduate School Of Systems And Information Engineering University Of Tsukuba
-
Yamada Takeshi
Graduate School Of Urban Environmental Sciences Tokyo Metropolitan University
-
Yamamoto Kiyoshi
Graduate School Of Systems And Information Engineering University Of Tsukuba
-
Yamada Takeshi
Graduate School Of Systems And Information Engineering University Of Tsukuba
関連論文
- CENSREC-1-C : An evaluation framework for voice activity detection under noisy environments
- A Speech Enhancement Technique Using Kalman Filter with State Vector of Time-Frequency Patterns(Special Section on Acoustic Signal Processing)
- New Design Method of a Binaural Microphone Array Using Multiple Constraints (Special Section on Advanced Signal Processing Techniques for Analysis of Acoustical and Vibrational Signals)
- Sound Field Reproduction by Controlling the Transfer Functions from the Source to Multiple Points in Close Proximity
- Convergence Characteristics of the Adaptive Array Using RLS Algorithm
- Sound localization in headphone reproduction by simulating transfer functions from the sound source to the external ear
- Objective Estimation of Word Intelligibility for Noise-Reduced Speech
- Non-reference Objective Quality Evaluation for Noise-Reduced Speech Using Overall Quality Estimation Model
- AURORA-2J: An Evaluation Framework for Japanese Noisy Speech Recognition(Speech Corpora and Related Topics, Corpus-Based Speech Technologies)
- A further investigation into the method for active suppression of reflected sound waves based on the state feedback control
- Active Control of Sound Intensity for Suppression of Reflected Sound Waves Based on the State Feedback Control(Special Section on Acoustic Signal Processing)
- A design of adaptive beamformer based on average speech spectrum for noisy speech recognition
- A Microphone Array-Based 3-D N-Best Search Method for Recognizing Multiple Sound Sources
- 25. Effect of Li ions on micro-phase separated structure of amphiphiilc di-block copolymer(poster presentation,Soft Matter as Structured Materials)
- Speech Enhancement Based on Short-Time Spectral Amplitude Estimation with Two-Channel Beamformer
- Information of Loudness in Aural Communication
- Comparison of Two Speech and Audio Coders at 8 kb/s from the Viewpoints of Coding Scheme and Quality (Special Issue on Performance and Quality of Service (QoS) of Multimedia Networks
- Laboratory-GISAXS Measurements of Block Copolymer Films with Highly Ordered and Normally Oriented Nanocylinders
- Thermally Reversible Structural Transformation Involving a C-H…O Hydrogen Bond in a Supramolecular Crystal
- Detection of Overlapping Speech in Meetings Using Support Vector Machines and Support Vector Regression(Engineering Acoustics)
- Comparative Assessment of Test Signals Used for Measuring Residual Echo Characteristics