Classification of speech under stress using physical features based on two-mass model

概要

論文の詳細を見る
We propose the classification methods of speech under stress based on a physical model, which characterizes the vocal folds and the vocal tract. We use physical parameters estimated by fitting a two-mass model to real speech, and two type of dynamic parameters are proposed to represent the short-term and long-term dynamic changes in physical characteristics. Experimental results show that our proposed features are effective and achieved better classification performance than features derived from traditional methods.
一般社団法人電子情報通信学会の論文
2013-02-21

著者

Miyajima Chiyomi
Graduate School of Information Science, Nagoya University
Kitaoka Norihide
Graduate School of Information Science, Nagoya University
TAKEDA Kazuya
Graduate School of Engineering, Nagoya University
Yao Xiao
Graduate School of Information Science, Nagoya University
Jitsuhiro Takatoshi
Graduate School of Information Science, Nagoya University
Jitsuhiro Takatoshi
Graduate School of Information Science, Nagoya University:Department of Media Informatics, Aichi University of Technology

関連論文

AN INTEGRATED AUDIO-VISUAL VIEWER FOR A LARGE SCALE MULTIPOINT CAMERAS AND MICROPHONES(International Workshop on Advanced Image Technology 2007)
THE SUB-BAND SOUND WAVE RAY-SPACE REPRESENTATION(International Workshop on Advanced Image Technology 2006)
Selective Listening Point Audio Based on Blind Signal Separation and Stereophonic Technology
Head-Related Transfer Function measurement in sagittal and frontal coordinates
Evaluation of HRTFs estimated using physical features
Multichannel Speech Enhancement Based on Generalized Gamma Prior Distribution with Its Online Adaptive Estimation
SNR and sub-band SNR estimation based on Gaussian mixture modeling in the log power domain with application for speech enhancements (第6回音声言語シンポジウム)
SNR and sub-band SNR estimation based on Gaussian mixture modeling in the log power domain with application for speech enhancements (第6回音声言語シンポジウム)
SNR and sub-band SNR estimation based on Gaussian mixture modeling in the log power domain with application for speech enhancements (第6回音声言語シンポジウム)
Driver's irritation detection using speech recognition results (音声・第10回音声言語シンポジウム)
Driver's irritation detection using speech recognition results (音声言語情報処理)
Driver's irritation detection using speech recognition results (言語理解とコミュニケーション・第10回音声言語シンポジウム)
サブバンドに含まれる周波数成分の瞬時周波数に基づく推定
Predicting the Degradation of Speech Recognition Performance from Sub-band Dynamic Ranges (特集音声言語情報処理とその応用)
Comparison of acoustic measures for evaluating speech recognition performance in an automobile
Estimation of speaker and listener positions in a car using binaural signals
Sound localization under conditions of covered ears on the horizontal plane
Construction and Evaluation of a Large In-Car Speech Corpus(Speech Corpora and Related Topics, Corpus-Based Speech Technologies)
Blind Source Separation Using Dodecahedral Microphone Array under Reverberant Conditions
Method for determining sound localization by auditory masking
Classification of speech under stress by physical modeling
Classification of speech under stress using physical features based on two-mass model

Classification of speech under stress using physical features based on two-mass model

スポンサーリンク

概要

著者

関連論文

スポンサーリンク