Speech Enhancement Using Nonlinear Microphone Array Based on Complementary Beamforming (Special Section on Digital Signal Processing)
- 論文の詳細を見る
This paper describes a spatial spectral subtraction method by using the complementary beamforming microphone array to enhance noisy speech signals for speech recognition. The complementary beamforming is based on two types of beamformers designed to obtain complementary directivity patterns with respect to each other. In this paper, it is shown that the nonlinear subtraction processing with complementary beam-forming can result in a kind of the spectral subtraction without the need for speech pause detection. In addition, the optimization algorithm for the directivity pattern is also described. To evaluate the effectiveness, speech enhancement experiments and speech recognition experiments are performed based on computer simulations under both stationary and nonstationary noise conditions. In comparison with the optimized conventional delay-and-sum (DS) array, it is shown that: (1) the proposed array improves the signal-to-noise ratio (SNR) of degraded speech by about 2 dB and performs more than 20% better in word recognition rates under the conditions that the white Gaussian noise with the input SNR of -5 or -10 dB is used, (2) the proposed array performs more than 5% better in word recognition rates under the nonstationary noise conditions. Also, it is shown that these improvements of the proposed array are same as or superior to those of the conventional spectral subtraction method cascaded with the DS array.
- 社団法人電子情報通信学会の論文
- 1999-08-25
Department of Nuclear Engineering, School of Engineering, Tokai University
Takeda K
Nagoya Univ. Nagoya Jpn
Takeda Kazuya
Department Of Information Electronics Graduate School Of Engineering Nagoya University
Department of Dermatology, Kagoshima University Graduate School of Medical and Dental Sciences
Kajita S
Center For Information Media Studies Nagoya University
Takeda K
Center For Integrated Acoustic Information Research Graduate School Of Engineering Nagoya University
Graduate School of Information Science, Nara Institute of Science and Technology
Saruwatari Hiroshi
Graduate School Of Information Science Nara Institute Of Science And Technology
Saruwatari Hiroshi
Department Of Dermatology Kagoshima University Graduate School Of Medical And Dental Sciences
Saruwatari H
Graduate School Of Information Science Nara Institute Of Science And Technology
ITAKURA Fumitada
Center for Information Media Studies, Nagoya University
Itakura F
Graduate School Of Information Engineering Meijo University
Itakura Fumitada
Center For Information Media Studies Nagoya University
Center for Information Media Studies, Nagoya University
- 磁化シートプラズマを用いたガス・ダイバータの基礎実験
- CENSREC-1-C : An evaluation framework for voice activity detection under noisy environments
- Genetic analysis of the ferrochelatase gene in eight Japanese patients from seven families with erythropoietic protoporphyria
- Building an Effective Speech Corpus by Utilizing Statistical Multidimensional Scaling Method
- Cost Reduction of Acoustic Modeling for Real-Environment Applications Using Unsupervised and Selective Training
- Effect of Central Limit Theorem non-compliance on blind separation of speech by negentropy maximization
- Robots that can hear, understand and talk
- Probability Distribution of Time-Series of Speech Spectral Components(Audio/Speech Coding)(Applications and Implementations of Digital Signal Processing)
- AURORA-2J: An Evaluation Framework for Japanese Noisy Speech Recognition(Speech Corpora and Related Topics, Corpus-Based Speech Technologies)
- CENSREC-3: An Evaluation Framework for Japanese Speech Recognition in Real Car-Driving Environments(Speech and Hearing)
- Non-Audible Murmur (NAM) Recognition Exploiting Adaptation Techniques
- MC-32 Development of microdrive assembly process
- Multiple Regression of Log Spectra for In-Car Speech Recognition Using Multiple Distributed Microphones(Feature Extraction and Acoustic Medelings, Corpus-Based Speech Technologies)
- Objective sound quality evaluation for combination method of beamforming and spectral subtraction (応用音響)
- Fast Convergence Blind Source Separation Using Frequency Subband Interpolation by Null Beamforming
- Rapid Compensation of Temperature Fluctuation Effect for Multichannel Sound Field Reproduction System
- Development, Long-Term Operation and Portability of a Real-Environment Speech-Oriented Guidance System
- On-Line Relaxation Algorithm Applicable to Acoustic Fluctuation for Inverse Filter in Multichannel Sound Reproduction System(Sound Field Reproduction, Multi-channel Acoustic Signal Processing)
- Iterative Inverse Filter Relaxation Algorithm for Adaptation to Acoustic Fluctuation in Sound Reproduction System
- Sound Reproduction System Including Adaptive Compensation of Temperature Fluctuation Effect for Broad-Band Sound Control(Special Section on Digital Signal Processing)
- Cutaneous Mycobacterium intracellulare infection in a bone marrow transplantation recipient
- Interface for Barge-in Free Spoken Dialogue System Combining Adaptive Sound Field Control and Microphone Array(Speech and Hearing)
- SNR and sub-band SNR estimation based on Gaussian mixture modeling in the log power domain with application for speech enhancements (第6回音声言語シンポジウム)
- Driver's irritation detection using speech recognition results (音声・第10回音声言語シンポジウム)
- Driver's irritation detection using speech recognition results (音声言語情報処理)
- Driver's irritation detection using speech recognition results (言語理解とコミュニケーション・第10回音声言語シンポジウム)
- A Self-Generator Method for Initial Filters of SIMO-ICA Applied to Blind Separation of Binaural Sound Mixtures(Blind Source Separation, Multi-channel Acoustic Signal Processing)
- Multistage SIMO-Model-Based Blind Source Separation Combining Frequency-Domain ICA and Time-Domain ICA(Adaptive Signal Processing and Its Applications)
- サブバンドに含まれる周波数成分の瞬時周波数に基づく推定
- Lack of Interaction Between Cefdinir and Calcium Polycarbophil : In vitro and In vivo Studies
- Predicting the Degradation of Speech Recognition Performance from Sub-band Dynamic Ranges (特集 音声言語情報処理とその応用)
- A model of perceptual distance for group delays based on ellipsoidal mapping
- The effect of group delay spectrum on timbre
- Direction of Arrival Estimation Using Nonlinear Microphone Array
- Speech Enhancement Using Nonlinear Microphone Array Based on Noise Adaptive Complementary Beamforming
- Speech Enhancement Using Nonlinear Microphone Array Based on Complementary Beamforming (Special Section on Digital Signal Processing)
- Noise Robust Speech Recognition Using Subband-Crosscorrelation Analysis
- An Acoustically Oriented Vocal-Tract Model
- Evaluation of Extremely Small Sound Source Signals Used in Speaking-Aid System with Statistical Voice Conversion
- Adaptive Nonlinear Regression Using Multiple Distributed Microphones for In-Car Speech Recognition(Speech Enhancement, Multi-channel Acoustic Signal Processing)
- Improvements of the One-to-Many Eigenvoice Conversion System
- Esophageal Speech Enhancement Based on Statistical Voice Conversion with Gaussian Mixture Models
- Adaptive Training for Voice Conversion Based on Eigenvoices
- Blind Separation and Deconvolution for Convolutive Mixture of Speech Combining SIMO-Model-Based ICA and Multichannel Inverse Filtering(Engineering Acoustics)
- Subband-Based Blind Separation for Convolutive Mixtures of Speech(Engineering Acoustics)
- Overdetermined Blind Separation for Real Convolutive Mixtures of Speech Based on Multistage ICA Using Subarray Processing(Speech/Acoustic Signal Processing)(Digital Signal Processing)
- Stable Learning Algorithm for Blind Separation of Temporally Correlated Acoustic Signals Combining Multistage ICA and Linear Prediction(Digital Signal Processing)
- Blind Source Separation of Acoustic Signals Based on Multistage ICA Combining Frequency-Domain ICA and Time-Domain ICA
- Fast-Convergence Algorithm for Blind Source Separation Based on Array Signal Processing
- An Iterative Inverse Filter Design Method for the Multichannel Sound Field Sound Field Reproduction System(Special Section on Acoustic Signal Processing)
- On the use of two-mass vocal cord model in characterizing the stress speech (音声)
- Particle Size Distribution Measurement of Free-Falling Fine Particles in a Dusty Plasma Experiment
- Semi-Blind Optimization Scheme of Joint Suppression of Background Noise and Late Reverberation
- Relaxation behavior of laser-peening residual stress under tensile loading investigated by X-ray and neutron diffraction