A Correcting Method for Pitch Extraction Using Neural Networks (Special Section of Papers Selected from JTC-CSCC'93)
スポンサーリンク
概要
- 論文の詳細を見る
Pitch frequency is a basic characteristic of human voice, and pitch extraction is one of the most important studies for speech recognition. This paper describes a simple but effective technique to obtain correct pitch frequency from candidates (pitch candidates) extracted by the short-range auto-correlation function. The correction is performed by a neural network in consideration of the time continuation that is realized by referring to pitch candidates at previous frames. Since the neural network is trained by the back-propagation algorithm with training data, it adapts to any speaker and obtains good correction without sensitive adjustment and tuning. The pitch extraction was performed for 3 male and 3 female announcers, and the proposed method improves the percentage of correct pitch from 58.65% to 89.19%.
- 社団法人電子情報通信学会の論文
- 1994-06-25
著者
-
OGIHARA Akio
College of Engineering, University of Osaka Prefecture
-
Ogihara Akio
College Of Engineering Osaka Prefecture University
-
Ogihara Akio
College Of Engineering University Of Osaka Prefecture
-
Fukunaga Kunio
College Of Engineering University Of Osaka Prefecture
-
Fukunaga Kunio
College Of Engineering Osaka Prefecture University
関連論文
- An Analysis on Minimum Searching Principle of Chaotic Neural Network (Special Section of Selected Papers from the 8th Karuizawa Workshop on Circuits and Systems)
- A Study on Mouth Shape Features Suitable for HMM Speech Recognition Using Fusion of Visual and Auditory Information
- Speech Recognition Using HMM Based on Fusion of Visual and Auditory Information : Special Section of Letters Selected from 1994 IEICE Spring Conference
- Asymmetric Neural Network and Its Application to Knapsack Problem
- An Improvement of the Pseudoinverse Rule with Diagonal Elements (Special Section of Papers Selected from JTC-CSCC'93)
- An Isolated Word Speech Recognition Using Fusion of Auditory and Visual Information (Special Section of Papers Selected from JTC-CSCC'95)
- Binary Neural Network with Negative Self-Feedback and Its Application to N-Queens Problem (Special Issue on Neurocomputing)
- An Extraction Method of Lip Shape for Independent Speaker
- An Isolated Word Speech Recognition Based on Fusion of Visual and Auditory Information Using 30-frame/s and 24-bit Color Image (Special Section on Digital Signal Processing)
- Object Recognition in Image Sequences with Hopfield Neural Network
- A Correcting Method for Pitch Extraction Using Neural Networks (Special Section of Papers Selected from JTC-CSCC'93)
- Object Recognition Using Model Relation Based on Fuzzy Logic