Classification of speech under stress using physical features based on two-mass model
スポンサーリンク
概要
- 論文の詳細を見る
We propose the classification methods of speech under stress based on a physical model, which characterizes the vocal folds and the vocal tract. We use physical parameters estimated by fitting a two-mass model to real speech, and two type of dynamic parameters are proposed to represent the short-term and long-term dynamic changes in physical characteristics. Experimental results show that our proposed features are effective and achieved better classification performance than features derived from traditional methods.
- 一般社団法人電子情報通信学会の論文
- 2013-02-21
著者
-
Miyajima Chiyomi
Graduate School of Information Science, Nagoya University
-
Kitaoka Norihide
Graduate School of Information Science, Nagoya University
-
TAKEDA Kazuya
Graduate School of Engineering, Nagoya University
-
Yao Xiao
Graduate School of Information Science, Nagoya University
-
Jitsuhiro Takatoshi
Graduate School of Information Science, Nagoya University
-
Jitsuhiro Takatoshi
Graduate School of Information Science, Nagoya University:Department of Media Informatics, Aichi University of Technology
関連論文
- AN INTEGRATED AUDIO-VISUAL VIEWER FOR A LARGE SCALE MULTIPOINT CAMERAS AND MICROPHONES(International Workshop on Advanced Image Technology 2007)
- THE SUB-BAND SOUND WAVE RAY-SPACE REPRESENTATION(International Workshop on Advanced Image Technology 2006)
- Selective Listening Point Audio Based on Blind Signal Separation and Stereophonic Technology
- Head-Related Transfer Function measurement in sagittal and frontal coordinates
- Evaluation of HRTFs estimated using physical features
- Multichannel Speech Enhancement Based on Generalized Gamma Prior Distribution with Its Online Adaptive Estimation
- SNR and sub-band SNR estimation based on Gaussian mixture modeling in the log power domain with application for speech enhancements (第6回音声言語シンポジウム)
- SNR and sub-band SNR estimation based on Gaussian mixture modeling in the log power domain with application for speech enhancements (第6回音声言語シンポジウム)
- SNR and sub-band SNR estimation based on Gaussian mixture modeling in the log power domain with application for speech enhancements (第6回音声言語シンポジウム)
- Driver's irritation detection using speech recognition results (音声・第10回音声言語シンポジウム)
- Driver's irritation detection using speech recognition results (音声言語情報処理)
- Driver's irritation detection using speech recognition results (言語理解とコミュニケーション・第10回音声言語シンポジウム)
- サブバンドに含まれる周波数成分の瞬時周波数に基づく推定
- Predicting the Degradation of Speech Recognition Performance from Sub-band Dynamic Ranges (特集 音声言語情報処理とその応用)
- Comparison of acoustic measures for evaluating speech recognition performance in an automobile
- Estimation of speaker and listener positions in a car using binaural signals
- Sound localization under conditions of covered ears on the horizontal plane
- Construction and Evaluation of a Large In-Car Speech Corpus(Speech Corpora and Related Topics, Corpus-Based Speech Technologies)
- Blind Source Separation Using Dodecahedral Microphone Array under Reverberant Conditions
- Method for determining sound localization by auditory masking
- Classification of speech under stress by physical modeling
- Classification of speech under stress using physical features based on two-mass model