Instrogram: Probabilistic Representation of Instrument Existence for Polyphonic Music (特集:便利で身近な音楽情報処理)
スポンサーリンク
概要
- 論文の詳細を見る
This paper presents a new technique for recognizing musical instruments in polyphonic music. Since conventional musical instrument recognition in polyphonic music is performed notewise, i.e., for each note, accurate estimation of the onset time and fundamental frequency (FO) of each note is required. However, these estimations are generally not easy in polyphonic music, and thus estimation errors severely deteriorated the recognition performance. Without these estimations, our technique calculates the temporal trajectory of instrument existence probabilities for every possible FO. The instrument existence probability is denned as the product of a nonspecific instrument existence probability calculated using the PreFEst and a conditional instrument existence probability calculated using hidden Markov models. The instrument existence probability is visualized as a spectrogram-like graphical representation called the instrogram and is applied to MPEG-7 annotation and instrumentation-similarity-based music information retrieval. Experimental results from both synthesized music and real performance recordings have shown that instrograms achieved MPEG-7 annotation (instrument identification) with a precision rate of 87.5% for synthesized music and 69.4% for real performances on average and that the instrumentation similarity measure reflected the actual instrumentation better than an MFCC-based measure.
- 一般社団法人情報処理学会の論文
- 2007-01-15
著者
-
Goto Masataka
National Institute of Advanced Industrial Science and Technology (AIST)
-
Ogata Tetsuya
Department Of Mechanical Engineering School Of Science And Engineering Waseda University
-
OKUNO HIROSHI
Departments of Urology, Kyoto University Graduate School of Medicine
-
Ogata Tetsuya
Graduate School Of Informatics Kyoto Univ. Yoshida-honmachi Sakyo-ku 606-8501 Kyoto Jpn
-
尾形 哲也
京大
-
奥乃 博
京大
-
Okuno Hiroshi
Graduate School Of Informatics Kyoto University
-
KOMATANI Kazunori
Department of Intelligence Science and Technology, Graduate School of Informatics, Kyoto University
-
KITAHARA TETSURO
Department of Intelligence Science and Technology, Graduate School of Informatics, Kyoto University
-
奥乃 博
Graduate School Of Informatics Kyoto University
-
尾形 哲也
Graduate School Of Informatics Kyoto University
-
Kitahara Tetsuro
Department Of Intelligence Science And Technology Graduate School Of Informatics Kyoto University
-
Goto Masataka
National Institute Of Advanced Industrial Science And Technology
-
Okuno Hiroshi
Department Of Orthopaedic Surgery Tohoku University School Of Medicine
-
Okuno Hiroshi
Department Of Applied Materials Science Faculty Of Engineering Osaka Prefecture University
-
OGATA Tetsuya
Department of Intelligence Science and Technology, Graduate School of Informatics, Kyoto University
関連論文
- 5R-5 A Music Retrieval Approach from Alternative Genres of Query by Adjusting Instrument Volume
- Inter-modality mapping in robot with recurrent neural network
- Assessment of a protocol for prophylactic antibiotics to prevent perioperative infection in urological surgery : A preliminary study
- Living related renal transplantation for end-stage renal disease after liver transplantation from a brain-dead donor
- Retroperitoneoscopic ureterocutaneostomy for obstructive uropathy with advanced bladder cancer : A case report
- Immunocytochemical Detection of p53 in Cultures of Exfoliated Cells from Urine of Patients With Urothelial Cancers
- Emotional Communication between Humans and the Autonomous Robot WAMOEBA-2 (Waseda Amoeba) Which has the Emotion Model
- Anaphylaxis following administration of intravenous methylprednisolone sodium succinate in a renal transplant recipient
- Adult-onset idiopathic hypogonadotropic hypogonadism presented with erectile and ejaculatory disorder
- 4R-3 Probabilistic Classification of Monophonic Instrument Playing Techniques
- 神経回路モデルの感覚・行為予測に基づく空間認知モデル
- Per-operative frozen section examination of pelvic nodes is unnecessary for the majority of clinically localized prostate cancers in the prostate-specific antigen era
- Predicting Object Dynamics From Visual Images Through Active Sensing Experiences
- Experience-based imitation using RNNPB
- Drumix: an audio player with real-time drum-part rearrangement functions for active music listening (特集 インタラクション技術の原理と応用)
- Instrogram: Probabilistic Representation of Instrument Existence for Polyphonic Music (特集:便利で身近な音楽情報処理)
- Dynamic Communication of Humanoid Robot with Multiple People Based on Interaction Distance (論文特集:人間と共生する情報システム)
- 6T-7 Robot Musical Accompaniment : Real-time Synchronization using Visual Cue Recognition
- Acquisition of Motion Primitives of Robot in Human-Navigation Task : Towards Human-Robot Interaction based on ``Quasi-Symbols
- Open-end human-robot interaction from the dynamical systems perspective : mutual adaptation and incremental learning
- Dynamic Communication of Humanoid Robot with Multiple People Based on Interaction Distance
- Acoustic Cavitation in Water under Rare Gas Atmosphere
- 可聴音波を用いたAHによる遮蔽物の検出と距離推定法(立体音響・音場制御/一般)
- Reinforcement learning of a continuous motor sequence with hidden states
- Dynamic perception after visually guided grasping by a human-like autonomous robot
- 1ZN-2 Score Following by Particle Filtering for Music Robots
- 音楽音響信号による室内音場共振周波数のブラインド推定
- Drumix: An Audio Player with Real-time Drum-part Rearrangement Functions for Active Music Listening
- Drumix: An Audio Player with Real-time Drum-part Rearrangement Functions for Active Music Listening
- Human-robot non-verbal interaction empowered by real-time auditory and visual multiple-talker tracking
- Interactive ECを用いたコミュニケーションロボットのための反射的行動の獲得(進化・学習とロボティクス3)
- 動作模倣による人間とロボットのインタラクション : 手先軌道データを用いた動作予測システムの構築(情緒・感性・身体性)
- Expression of Apg-1, a member of the Hsp110 family, in the human testis and sperm
- Diagram specific to sacroiliac joint pain site indicated by one-finger test
- Access Control by SPKI Certificate
- Common Acoustical Pole Estimation from Multi-Channel Musical Audio Signals(Engineering Acoustics)
- Target Speech Detection and Separation for Communication with Humanoid Robots in Noisy Home Environments
- Self-organization of Dynamic Object Features Based on Bidirectional Training
- Human Tracking System Integrating Sound and Face Localization Using an Expectation-Maximization Algorithm in Real Environments
- Recent studies on music information processing
- Selecting Help Messages by Using Robust Grammar Verification for Handling Out-of-Grammar Utterances in Spoken Dialogue Systems
- 1P1-F11 自己形態主張を行うカスタマイズ可能なコミュニケーションロボットの研究
- 1P1-E21 ハードウェアをカスタマイズできるコミュニケーションロボットにおける研究 : 形態主張行動の可能理解性がユーザに与える効果の検証
- 1P1-G11 ユーザのカスタマイズを受容・拒否できる機構を持つロボットシステムの開発(コミュニケーション・ロボット)
- 2P1-D18 IECを用いたコミュニケーションロボットにおける人間の主観変化への適応
- 2P2-G11 自己組織化回路素子SONEの制御回路構造形成メカニズム : 移動ロボットの衝突回避学習における制御回路構造の発達(進化・学習とロボティクス)
- 2P1-G06 物体操作に関する脳の情報処理構造を参考にした運動学習モデル : 身体モデルと目標運動軌道との並行学習(脳科学・神経科学とロボティクス)
- 2P1-G05 神経調節機能を参考とした自律エージェントの神経制御器の開発(脳科学・神経科学とロボティクス)
- 2A1-B10 ニューラルネットによる腱駆動ロボットアームの制御 : ランダムな運動からの逆モデルの学習(移動知)
- 2A1-E07 CTRNNを用いた連続な状態空間における強化学習法の提案
- Design and Implementation of Robot Audition System 'HARK'-Open Source Software for Listening to Three Simultaneous Speakers
- 2P1-D19 人間-ロボット間コミュニケーションのための自律系と情動表出の段階的進化
- 2A1-E03 自己組織化ネットワーク素子群における対ノイズ性能の向上
- Micturitional disturbance due to labial adhesion as a cause of vaginal implantation of bladder urothelial carcinoma
- 1P1-C19 自己形態主張型カスタマイズロボットの開発 : カスタマイズ要求時における音を用いた主張行動の検証
- 1P1-C18 ハードウェアをカスタマイズできるコミュニケーションロボットに関する研究 : ロボットの理想形態への理解度推定に基づく動作生成
- Automatic Allocation of Training Data for Speech Understanding Based on Multiple Model Combinations
- Robust Multipitch Analyzer against Initialization based on Latent Harmonic Allocation using Overtone Corpus
- Robust Multipitch Analyzer against Initialization based on Latent Harmonic Allocation using Overtone Corpus
- Poly-β-Amino Acids. I. The Preparation of Phenyl Substituted β-Amino Acid Polymers
- A Musical Robot that Synchronizes with a Coplayer Using Non-Verbal Cues
- Towards Written Text Recognition Based on Handwriting Experiences Using a Recurrent Neural Network
- Parameter Estimation for Harmonic and Inharmonic Models by Using Timbre Feature Distributions
- Instrogram: Probabilistic Representation of Instrument Existence for Polyphonic Music
- Instrogram: Probabilistic Representation of Instrument Existence for Polyphonic Music
- Classification of Known and Unknown Environmental Sounds Based on Self-Organized Space Using a Recurrent Neural Network
- 2A1-D09 自己形態主張カスタマイズロボットの開発 : ユーザの自己効力感向上がインタラクションに与える効果の検証(コミュニケーション-ロボット)
- Musicream: Integrated Music-Listening Interface for Active, Flexible, and Unexpected Encounters with Musical Pieces