Cascaded Subband Energy-Based Emotion Classification
スポンサーリンク
概要
- 論文の詳細を見る
Since the earliest studies of human behavior, emotions have attracted attention of researchers in many disciplines, including psychology, neuroscience, and lately computer science. Speech is considered a salient conveyor of emotional cues, and can be used as an important source for emotional studies. Speech is modulated for different emotions by varying frequency- and energy-related acoustic parameters such as pitch, energy, and formants. In this paper, we explore analyzing inter- and intra-subband energy variations to differentiate six emotions. The emotions considered are anger, disgust, fear, happiness, neutral, and sadness. In this research, Two-Layered Cascaded Subband Cepstral Coefficients (TLCS-CC) analysis was introduced to study energy variations within low and high arousal emotions as a novel approach for emotion classification. The new approach was compared with Mel frequency cepstral coefficients (MFCC) and log frequency power coefficients (LFPC). Experiments were conducted on the Berlin Emotional Data Corpus (BECD). With energy-related features, we could achieve average accuracy of 73.9% and 80.1% for speaker-independent and -dependent emotion classification respectively.
- 電気学会 ; 1972-の論文
著者
-
COHEN Michael
Spatial Media Group at the University of Aizu
-
Silva Liyanage
Faculty of Science, University of Brunei Darussalam
-
Nwe Tin
Institute for Infocomm Research
-
Amarakeerthi Senaka
Spatial Media Group, University of Aizu
-
Morikawa Chamin
Interfaculty Initiative in Information Studies The University of Tokyo
関連論文
- Mobile narrowcasting control and display of spatial sound (マルチメディア通信と分散処理)
- Audio Narrowcasting and Privacy for Multipresent Avatars on Workstations and Mobile Phones(Artificial Reality and Telexistence)
- Cascaded Subband Energy-Based Emotion Classification
- Cascaded Subband Energy-Based Emotion Classification