Robust Multipitch Analyzer against Initialization based on Latent Harmonic Allocation using Overtone Corpus
スポンサーリンク
概要
- 論文の詳細を見る
We present a Bayesian analysis method that estimates the harmonic structure of musical instruments in music signals on the basis of psychoacoustic evidence. Since the main objective of multipitch analysis is joint estimation of the fundamental frequencies and their harmonic structures, the performance of harmonic structure estimation significantly affects fundamental frequency estimation accuracy. Many methods have been proposed for estimating the harmonic structure accurately, but no method has been proposed that satisfies all these requirements: robust against initialization, optimization-free, and psychoacoustically appropriate and thus easy to develop further. Our method satisfies these requirements by explicitly incorporating Terhardt's virtual pitch theory within a Bayesian framework. It does this by automatically learning the valid weight range of the harmonic components using a MIDI synthesizer. The bounds are termed "overtone corpus." Modeling demonstrated that the proposed overtone corpus method can stably estimate the harmonic structure of 40 musical pieces for a wide variety of initial settings.
著者
-
Itoyama Katsutoshi
Graduate School Of Informatics Kyoto University
-
Ogata Tetsuya
Graduate School Of Informatics Kyoto Univ. Yoshida-honmachi Sakyo-ku 606-8501 Kyoto Jpn
-
Okuno Hiroshi
Graduate School Of Informatics Kyoto University
-
Sakaue Daichi
Graduate School of Informatics, Kyoto University
関連論文
- 5R-5 A Music Retrieval Approach from Alternative Genres of Query by Adjusting Instrument Volume
- Inter-modality mapping in robot with recurrent neural network
- 4R-3 Probabilistic Classification of Monophonic Instrument Playing Techniques
- Predicting Object Dynamics From Visual Images Through Active Sensing Experiences
- Experience-based imitation using RNNPB
- Drumix: an audio player with real-time drum-part rearrangement functions for active music listening (特集 インタラクション技術の原理と応用)
- Instrogram: Probabilistic Representation of Instrument Existence for Polyphonic Music (特集:便利で身近な音楽情報処理)
- Dynamic Communication of Humanoid Robot with Multiple People Based on Interaction Distance (論文特集:人間と共生する情報システム)
- Acquisition of Motion Primitives of Robot in Human-Navigation Task : Towards Human-Robot Interaction based on ``Quasi-Symbols
- Open-end human-robot interaction from the dynamical systems perspective : mutual adaptation and incremental learning
- Dynamic Communication of Humanoid Robot with Multiple People Based on Interaction Distance
- Reinforcement learning of a continuous motor sequence with hidden states
- Dynamic perception after visually guided grasping by a human-like autonomous robot
- 1ZN-2 Score Following by Particle Filtering for Music Robots
- Drumix: An Audio Player with Real-time Drum-part Rearrangement Functions for Active Music Listening
- Drumix: An Audio Player with Real-time Drum-part Rearrangement Functions for Active Music Listening
- Common Acoustical Pole Estimation from Multi-Channel Musical Audio Signals(Engineering Acoustics)
- Target Speech Detection and Separation for Communication with Humanoid Robots in Noisy Home Environments
- Self-organization of Dynamic Object Features Based on Bidirectional Training
- Human Tracking System Integrating Sound and Face Localization Using an Expectation-Maximization Algorithm in Real Environments
- Selecting Help Messages by Using Robust Grammar Verification for Handling Out-of-Grammar Utterances in Spoken Dialogue Systems
- Design and Implementation of Robot Audition System 'HARK'-Open Source Software for Listening to Three Simultaneous Speakers
- Automatic Allocation of Training Data for Speech Understanding Based on Multiple Model Combinations
- Robust Multipitch Analyzer against Initialization based on Latent Harmonic Allocation using Overtone Corpus
- Robust Multipitch Analyzer against Initialization based on Latent Harmonic Allocation using Overtone Corpus
- A Musical Robot that Synchronizes with a Coplayer Using Non-Verbal Cues
- Classification of Known and Unknown Environmental Sounds Based on Self-Organized Space Using a Recurrent Neural Network