Underdetermined Blind Separation of Convolutive Mixtures of Speech Using Time-Frequency Mask and Mixing Matrix Estimation(Blind Source Separation, <Special Section>Multi-channel Acoustic Signal Processing)
スポンサーリンク
概要
- 論文の詳細を見る
This paper focuses on the underdetermined blind source separation (BSS) of three speech signals mixed in a real environment from measurements provided by two sensors. To date, solutions to the underdetermined BSS problem have mainly been based on the assumption that the speech signals are sufficiently sparse. They involve designing binary masks that extract signals at time-frequency points where only one signal was assumed to exist. The major issue encountered in previous work relates to the occurrence of distortion, which affects a separated signal with loud musical noise. To overcome this problem, we propose combining sparseness with the use of an estimated mixing matrix. First, we use a geometrical approach to detect when only one source is active and to perform a preliminary separation with a time-frequency mask. This information is then used to estimate the mixing matrix, which allows us to improve our separation. Experimental results show that this combination of time-frequency mask and mixing matrix estimation provides separated signals of better quality (less distortion, less musical noise) than those extracted without using the estimated mixing matrix in reverberant conditions where the reverberant time (TR) was 130ms and 200ms. Furthermore, informal listening tests clearly show that musical noise is deeply lowered by the proposed method comparatively to the classical approaches.
- 社団法人電子情報通信学会の論文
- 2005-07-01
著者
-
Makino Shoji
NTT Communication Science Laboratories
-
Blin Audrey
Ntt Communication Science Laboratories Ntt Corporation:(present Address)universite Du Quebec Inrs Em
-
Makino Shoji
Ntt Communication Science Laboratories Ntt Corporation
-
ARAKI Shoko
NTT Communication Science Laboratories, NTT Corporation
-
Araki Shoko
Ntt Communication Science Laboratories Ntt Corporation
関連論文
- A Fast Projection Algorithm for Adaptive Filtering
- A desing of a hands-free communication unit using loudspeakers and microphones with a flat directional pattern
- Subjective Assessment of the Desired Echo Return Loss for Subband Acoustic Echo Cancellers
- FOREWORD
- Subband-Based Blind Separation for Convolutive Mixtures of Speech(Engineering Acoustics)
- Estimating the number of sources using independent component analysis
- Blind Source Separation of Convolutive Mixtures of Speech in Frequency Domain(Multi-channel Acoustic Signal Processing)
- Blind Source Separation for Moving Speech Signals Using Blockwise ICA and Residual Crosstalk Subtraction(Speech/Acoustic Signal Processing)(Digital Signal Processing)
- Convolutive blind source separation for more than two sources in the frequency domain
- Evaluation of separation and dereverberation performance in frequency domain blind source separation
- Underdetermined Blind Separation of Convolutive Mixtures of Speech Using Time-Frequency Mask and Mixing Matrix Estimation(Blind Source Separation, Multi-channel Acoustic Signal Processing)
- Sparse source separation based on simultaneous clustering of source locational and spectral features
- Stereophonic acoustic echo cancellation : An overview and recent solutions
- Polar Coordinate Based Nonlinear Function for Frequency-Domain Blind Source Separation