Common Acoustical Pole Estimation from Multi-Channel Musical Audio Signals(Engineering Acoustics)
スポンサーリンク
概要
- 論文の詳細を見る
This paper describes a method for estimating the amplitude characteristics of poles common to multiple room transfer functions from musical audio signals received by multiple microphones. Knowledge of these pole characteristics would make it easier to manipulate audio equalizers, since they correspond to the room resonance. It has been proven that an estimate of the poles can be calculated precisely when a source signal is white. However, if a source signal is colored as in the case of a musical audio signal, the estimate is degraded by the frequency characteristics originally contained in the source signal. In this paper, we consider that an amplitude spectrum of a musical audio signal consists of its envelope and fine structure. We assume that musical pieces can be classified into several categories according to their average amplitude spectral envelopes. Based on this assumption, the amplitude spectral envelope of the musical audio signal can be obtained from prior knowledge of the average amplitude spectral envelope of a musical piece category into which the target piece is classified. On the other hand, the fine structure is identified based on its time variance. By removing both the spectral envelope and the fine structure from the amplitude spectrum estimated with the conventional method, the amplitude characteristics of the acoustical poles can be extracted. Simulation results for 20 popular songs revealed that our method was capable of estimating the amplitude characteristics of the acoustical poles with a spectral distortion of 3.11 dB. In particular, most of the spectral peaks, corresponding to the room resonance modes, were successfully detected.
- 社団法人電子情報通信学会の論文
- 2006-01-01
著者
-
OKUNO HIROSHI
Departments of Urology, Kyoto University Graduate School of Medicine
-
奥乃 博
京大
-
Okuno Hiroshi
Graduate School Of Informatics Kyoto University
-
奥乃 博
Graduate School Of Informatics Kyoto University
-
YOSHIOKA Takuya
Department of Applied Physics, Osaka University
-
HIKICHI Takafumi
NTT Communication Science Laboratories, NTT Corporation
-
MIYOSHI Masato
NTT Communication Science Laboratories, NTT Corporation
-
Yoshioka Takuya
Department Of Applied Chemistry Graduate School Of Engineering Tohoku University
-
Yoshioka Takuya
Department Of Intelligence Science And Technology Graduate School Of Informatics Kyoto University
-
Miyoshi Masato
Ntt Communication Science Laboratories Ntt Corporation
-
Okuno Hiroshi
Department Of Orthopaedic Surgery Tohoku University School Of Medicine
-
Okuno Hiroshi
Department Of Applied Materials Science Faculty Of Engineering Osaka Prefecture University
-
Hikichi Takafumi
Ntt Communication Science Laboratories Ntt Corporation
-
Miyoshi Masato
NTT Communication Science Laboratories
-
Hikichi Takafumi
NTT Communication Science Laboratories
関連論文
- 5R-5 A Music Retrieval Approach from Alternative Genres of Query by Adjusting Instrument Volume
- Assessment of a protocol for prophylactic antibiotics to prevent perioperative infection in urological surgery : A preliminary study
- Living related renal transplantation for end-stage renal disease after liver transplantation from a brain-dead donor
- Retroperitoneoscopic ureterocutaneostomy for obstructive uropathy with advanced bladder cancer : A case report
- Immunocytochemical Detection of p53 in Cultures of Exfoliated Cells from Urine of Patients With Urothelial Cancers
- Anaphylaxis following administration of intravenous methylprednisolone sodium succinate in a renal transplant recipient
- Adult-onset idiopathic hypogonadotropic hypogonadism presented with erectile and ejaculatory disorder
- 4R-3 Probabilistic Classification of Monophonic Instrument Playing Techniques
- Per-operative frozen section examination of pelvic nodes is unnecessary for the majority of clinically localized prostate cancers in the prostate-specific antigen era
- Predicting Object Dynamics From Visual Images Through Active Sensing Experiences
- Experience-based imitation using RNNPB
- Drumix: an audio player with real-time drum-part rearrangement functions for active music listening (特集 インタラクション技術の原理と応用)
- Instrogram: Probabilistic Representation of Instrument Existence for Polyphonic Music (特集:便利で身近な音楽情報処理)
- Dynamic Communication of Humanoid Robot with Multiple People Based on Interaction Distance (論文特集:人間と共生する情報システム)
- 6T-7 Robot Musical Accompaniment : Real-time Synchronization using Visual Cue Recognition
- Dynamic Communication of Humanoid Robot with Multiple People Based on Interaction Distance
- Acoustic Cavitation in Water under Rare Gas Atmosphere
- Sound image rendering using a loudspeaker and a fully open-air headphone-set
- CO_2 Sensing Mechanism of La_2O_3-loaded SnO_2
- A Remarkable Sensitivity of CaO-loaded In_2O_3 Element to CO_2 Gas in the Presence of Water Vapor
- 可聴音波を用いたAHによる遮蔽物の検出と距離推定法(立体音響・音場制御/一般)
- 1ZN-2 Score Following by Particle Filtering for Music Robots
- 音楽音響信号による室内音場共振周波数のブラインド推定
- Drumix: An Audio Player with Real-time Drum-part Rearrangement Functions for Active Music Listening
- Drumix: An Audio Player with Real-time Drum-part Rearrangement Functions for Active Music Listening
- Human-robot non-verbal interaction empowered by real-time auditory and visual multiple-talker tracking
- Mott Transition in the Hubbard Model on Checkerboard Lattice(Condensed matter: electronic structure and electrical, magnetic, and optical properties)
- Frustrated Ising Model on the Garnet Lattice(Condensed Matter : Structure, Mechanical and Thermal Properties)
- Expression of Apg-1, a member of the Hsp110 family, in the human testis and sperm
- On a Blind Speech Dereverberation Algorithm Using Multi-Channel Linear Prediction(Engineering Acoustics)
- Speech dereverberation algorithm using transfer function estimates with overestimated order
- Blind dereverberation algorithm for speech signals based on multi-channel linear prediction
- Diagram specific to sacroiliac joint pain site indicated by one-finger test
- Access Control by SPKI Certificate
- Common Acoustical Pole Estimation from Multi-Channel Musical Audio Signals(Engineering Acoustics)
- TG-MS Analysis of Dehydrochlorination of Poly(vinylidene chloride)
- Target Speech Detection and Separation for Communication with Humanoid Robots in Noisy Home Environments
- Self-organization of Dynamic Object Features Based on Bidirectional Training
- Human Tracking System Integrating Sound and Face Localization Using an Expectation-Maximization Algorithm in Real Environments
- New Treatment Method for Dilute Hydrochloric Acid Using Magnesium-Aluminum Oxide
- New Treatment Methods for Waste Water Containing Chloride Ion Using Magnesium - Aluminum Oxide
- Analysis of Two Stages Dehydrochlorination of Poly (vinyl chloride) Using TG-MS
- Synthesis of a Photoaffinity-labeling Analog of Alternariolide(AM-toxin I), a Host-specific Phytotoxin
- Selecting Help Messages by Using Robust Grammar Verification for Handling Out-of-Grammar Utterances in Spoken Dialogue Systems
- A Study on Frequency Characteristics and Transmission Path of Audible Sound Perceived when the Tragus is Vibrated by Amplitude-Modulated Ultrasound
- Design and Implementation of Robot Audition System 'HARK'-Open Source Software for Listening to Three Simultaneous Speakers
- Micturitional disturbance due to labial adhesion as a cause of vaginal implantation of bladder urothelial carcinoma
- Fast estimation of a precise dereverberation filter based on the harmonic structure of speech
- Harmonicity Based Dereverberation for Improving Automatic Speech Recognition Performance and Speech Intelligibility(Speech Enhancement, Multi-channel Acoustic Signal Processing)
- Calculating Inverse Filters for Speech Dereverberation
- Sound timbre interpolation based on physical modelimg
- Measurements of the resonance frequencies and the reed vibration of the sho
- A new algorithm for blind estimation of common poles in multiple transmission paths based on linear prediction
- Sound timbre control using estimates of room resonance modes
- Automatic Allocation of Training Data for Speech Understanding Based on Multiple Model Combinations
- Acoustic Nonlinear Effect on Auricular Cartilage Vibrated with Amplitude-Modulated Ultrasound
- Robust Multipitch Analyzer against Initialization based on Latent Harmonic Allocation using Overtone Corpus
- Robust Multipitch Analyzer against Initialization based on Latent Harmonic Allocation using Overtone Corpus
- Poly-β-Amino Acids. I. The Preparation of Phenyl Substituted β-Amino Acid Polymers
- A Musical Robot that Synchronizes with a Coplayer Using Non-Verbal Cues
- Towards Written Text Recognition Based on Handwriting Experiences Using a Recurrent Neural Network
- Instrogram: Probabilistic Representation of Instrument Existence for Polyphonic Music
- Instrogram: Probabilistic Representation of Instrument Existence for Polyphonic Music
- Classification of Known and Unknown Environmental Sounds Based on Self-Organized Space Using a Recurrent Neural Network