Subband-Based Blind Separation for Convolutive Mixtures of Speech(Engineering Acoustics)
スポンサーリンク
概要
- 論文の詳細を見る
We propose utilizing subband-based blind source separation (BSS) for convolutive mixtures of speech. This is motivated by the drawback of frequency-domain BSS, i.e., when a long frame with a fixed long frame-shift is used to cover reverberation, the number of samples in each frequency decreases and the separation performance is degraded. In subband BSS, (1) by using a moderate number of subbands, a sufficient number of samples can be held in each subband, and (2) by using FIR filters in each subband, we can manage long reverberation. We confirm that subband BSS achieves better performance than frequency-domain BSS. Moreover, subband BSS allows us to select a separation method suited to each subband. Using this advantage, we propose efficient separation procedures that consider the frequency characteristics of room reverberation and speech signals (3) by using longer unmixing filters in low frequency bands and (4) by adopting an overlap-blockshift in BSS's batch adaptation in low frequency bands. Consequently, frequency-dependent subband processing is successfully realized with the proposed subband BSS.
- 社団法人電子情報通信学会の論文
- 2005-12-01
著者
-
Saruwatari Hiroshi
Graduate School Of Information Science Nara Institute Of Science And Technology
-
Makino Shoji
NTT Communication Science Laboratories
-
NISHIKAWA Tsuyoki
Graduate School of Information Science, Nara Institute of Science and Technology
-
ARAKI Shoko
NTT Communication Science Laboratories, NTT Corporation
-
AICHNER Robert
University Erlangen-Nuremberg
-
Araki Shoko
Ntt Communication Science Laboratories Ntt Corporation
-
Nishikawa Tsuyoki
Graduate School Of Information Science Nara Institute Of Science And Technology
関連論文
- Building an Effective Speech Corpus by Utilizing Statistical Multidimensional Scaling Method
- Cost Reduction of Acoustic Modeling for Real-Environment Applications Using Unsupervised and Selective Training
- Effect of Central Limit Theorem non-compliance on blind separation of speech by negentropy maximization
- Robots that can hear, understand and talk
- Probability Distribution of Time-Series of Speech Spectral Components(Audio/Speech Coding)(Applications and Implementations of Digital Signal Processing)
- Non-Audible Murmur (NAM) Recognition Exploiting Adaptation Techniques
- Objective sound quality evaluation for combination method of beamforming and spectral subtraction (応用音響)
- Fast Convergence Blind Source Separation Using Frequency Subband Interpolation by Null Beamforming
- Rapid Compensation of Temperature Fluctuation Effect for Multichannel Sound Field Reproduction System
- Development, Long-Term Operation and Portability of a Real-Environment Speech-Oriented Guidance System
- On-Line Relaxation Algorithm Applicable to Acoustic Fluctuation for Inverse Filter in Multichannel Sound Reproduction System(Sound Field Reproduction, Multi-channel Acoustic Signal Processing)
- Iterative Inverse Filter Relaxation Algorithm for Adaptation to Acoustic Fluctuation in Sound Reproduction System
- Sound Reproduction System Including Adaptive Compensation of Temperature Fluctuation Effect for Broad-Band Sound Control(Special Section on Digital Signal Processing)
- Interface for Barge-in Free Spoken Dialogue System Combining Adaptive Sound Field Control and Microphone Array(Speech and Hearing)
- A desing of a hands-free communication unit using loudspeakers and microphones with a flat directional pattern
- A Self-Generator Method for Initial Filters of SIMO-ICA Applied to Blind Separation of Binaural Sound Mixtures(Blind Source Separation, Multi-channel Acoustic Signal Processing)
- Multistage SIMO-Model-Based Blind Source Separation Combining Frequency-Domain ICA and Time-Domain ICA(Adaptive Signal Processing and Its Applications)
- Subjective Assessment of the Desired Echo Return Loss for Subband Acoustic Echo Cancellers
- Direction of Arrival Estimation Using Nonlinear Microphone Array
- Speech Enhancement Using Nonlinear Microphone Array Based on Noise Adaptive Complementary Beamforming
- Speech Enhancement Using Nonlinear Microphone Array Based on Complementary Beamforming (Special Section on Digital Signal Processing)
- Evaluation of Extremely Small Sound Source Signals Used in Speaking-Aid System with Statistical Voice Conversion
- Improvements of the One-to-Many Eigenvoice Conversion System
- Esophageal Speech Enhancement Based on Statistical Voice Conversion with Gaussian Mixture Models
- Adaptive Training for Voice Conversion Based on Eigenvoices
- Blind Separation and Deconvolution for Convolutive Mixture of Speech Combining SIMO-Model-Based ICA and Multichannel Inverse Filtering(Engineering Acoustics)
- High-Fidelity Blind Separation of Acoustic Signals Using SIMO-Model-Based Independent Component Analysis(Engineering Acoustics)
- FOREWORD
- Subband-Based Blind Separation for Convolutive Mixtures of Speech(Engineering Acoustics)
- Estimating the number of sources using independent component analysis
- Blind Source Separation of Convolutive Mixtures of Speech in Frequency Domain(Multi-channel Acoustic Signal Processing)
- Blind Source Separation for Moving Speech Signals Using Blockwise ICA and Residual Crosstalk Subtraction(Speech/Acoustic Signal Processing)(Digital Signal Processing)
- Convolutive blind source separation for more than two sources in the frequency domain
- Evaluation of separation and dereverberation performance in frequency domain blind source separation
- Overdetermined Blind Separation for Real Convolutive Mixtures of Speech Based on Multistage ICA Using Subarray Processing(Speech/Acoustic Signal Processing)(Digital Signal Processing)
- Stable Learning Algorithm for Blind Separation of Temporally Correlated Acoustic Signals Combining Multistage ICA and Linear Prediction(Digital Signal Processing)
- Blind Source Separation of Acoustic Signals Based on Multistage ICA Combining Frequency-Domain ICA and Time-Domain ICA
- Fast-Convergence Algorithm for Blind Source Separation Based on Array Signal Processing
- An Iterative Inverse Filter Design Method for the Multichannel Sound Field Sound Field Reproduction System(Special Section on Acoustic Signal Processing)
- Underdetermined Blind Separation of Convolutive Mixtures of Speech Using Time-Frequency Mask and Mixing Matrix Estimation(Blind Source Separation, Multi-channel Acoustic Signal Processing)
- Sparse source separation based on simultaneous clustering of source locational and spectral features
- Stereophonic acoustic echo cancellation : An overview and recent solutions
- Polar Coordinate Based Nonlinear Function for Frequency-Domain Blind Source Separation
- Semi-Blind Optimization Scheme of Joint Suppression of Background Noise and Late Reverberation