Selective Listening Point Audio Based on Blind Signal Separation and Stereophonic Technology
スポンサーリンク
概要
- 論文の詳細を見る
A sound field reproduction method is proposed that uses blind source separation and a head-related transfer function. In the proposed system, multichannel acoustic signals captured at distant microphones are decomposed to a set of location/signal pairs of virtual sound sources based on frequency-domain independent component analysis. After estimating the locations and the signals of the virtual sources by convolving the controlled acoustic transfer functions with each signal, the spatial sound is constructed at the selected point. In experiments, a sound field made by six sound sources is captured using 48 distant microphones and decomposed into sets of virtual sound sources. Since subjective evaluation shows no significant difference between natural and reconstructed sound when six virtual sources and are used, the effectiveness of the decomposing algorithm as well as the virtual source representation are confirmed.
- 2009-03-01
著者
-
Takeda Kazuya
Graduate School Of Information Science Nagoya University
-
NIWA Kenta
Graduate School of Information Science, Nagoya University
-
NISHINO Takanori
Center for Information Media Studies, Nagoya University
-
Nishino Takanori
Center For Information Media Studies Nagoya University
-
Niwa Kenta
Graduate School Of Information Science Nagoya University
-
Takeda Kazuya
Graduate School Of Information Science At Nagoya University
-
Nishino Takanori
Mie Univ. Tsu‐shi Jpn
-
TAKEDA Kazuya
Graduate School of Engineering, Nagoya University
関連論文
- AN INTEGRATED AUDIO-VISUAL VIEWER FOR A LARGE SCALE MULTIPOINT CAMERAS AND MICROPHONES(International Workshop on Advanced Image Technology 2007)
- AN INTEGRATED AUDIO-VISUAL VIEWER FOR A LARGE SCALE MULTIPOINT CAMERAS AND MICROPHONES
- G_007 Arbitrary Listening-point Generation Using Acoustic Transfer Function Interpolation in A Large Microphone Array
- THE SUB-BAND SOUND WAVE RAY-SPACE REPRESENTATION(International Workshop on Advanced Image Technology 2006)
- A-16-24 3D Sound Wave Field Representation Based on Ray-Space Method(A-16. マルチメディア・仮想環境基礎, 基礎・境界)
- Selective Listening Point Audio Based on Blind Signal Separation and Stereophonic Technology
- Head-Related Transfer Function measurement in sagittal and frontal coordinates
- Evaluation of HRTFs estimated using physical features
- Multiple Regression of Log Spectra for In-Car Speech Recognition Using Multiple Distributed Microphones(Feature Extraction and Acoustic Medelings, Corpus-Based Speech Technologies)
- Evaluation of Combinational Use of Discriminant Analysis-Based Acoustic Feature Transformation and Discriminative Training
- Acoustic Feature Transformation Based on Discriminant Analysis Preserving Local Structure for Speech Recognition
- Multichannel Speech Enhancement Based on Generalized Gamma Prior Distribution with Its Online Adaptive Estimation
- SNR and sub-band SNR estimation based on Gaussian mixture modeling in the log power domain with application for speech enhancements (第6回音声言語シンポジウム)
- SNR and sub-band SNR estimation based on Gaussian mixture modeling in the log power domain with application for speech enhancements (第6回音声言語シンポジウム)
- SNR and sub-band SNR estimation based on Gaussian mixture modeling in the log power domain with application for speech enhancements (第6回音声言語シンポジウム)
- Acoustic Feature Transformation Combining Average and Maximum Classification Error Minimization Criteria
- Driver's irritation detection using speech recognition results (音声・第10回音声言語シンポジウム)
- Driver's irritation detection using speech recognition results (音声言語情報処理)
- Driver's irritation detection using speech recognition results (言語理解とコミュニケーション・第10回音声言語シンポジウム)
- サブバンドに含まれる周波数成分の瞬時周波数に基づく推定
- Predicting the Degradation of Speech Recognition Performance from Sub-band Dynamic Ranges (特集 音声言語情報処理とその応用)
- Comparison of acoustic measures for evaluating speech recognition performance in an automobile
- Estimation of speaker and listener positions in a car using binaural signals
- Sound localization under conditions of covered ears on the horizontal plane
- Adaptive Nonlinear Regression Using Multiple Distributed Microphones for In-Car Speech Recognition(Speech Enhancement, Multi-channel Acoustic Signal Processing)
- Construction and Evaluation of a Large In-Car Speech Corpus(Speech Corpora and Related Topics, Corpus-Based Speech Technologies)
- Blind Source Separation Using Dodecahedral Microphone Array under Reverberant Conditions
- FOREWORD : Spercial Section on Robust Speech Processing in Realistic Environments
- Method for determining sound localization by auditory masking
- Classification of speech under stress by physical modeling
- Classification of speech under stress using physical features based on two-mass model