A flexible temporal decomposition-based spectral modification method using asymmetric Gaussian mixture model (音声)
スポンサーリンク
概要
- 論文の詳細を見る
This paper presents a new spectral modification method to deal with two drawbacks of standard spectral modification methods, insufficient smoothness of the modified spectra between frames and ineffective spectral modification. A speech analysis technique called Temporal Decomposition (TD), which decomposes speech into event targets and event functions, is used to effectively model the spectral evolution. Instead of modifying the speech spectra frame by frame, we only need to modify event targets and event functions. We then proposed a new method to model and modify an event target by using asymmetric Gaussian mixture model (AGMM). Experimental results show that the effectiveness of the proposed spectral modification method is verified in terms of the smoothness of modified speech and effective spectral modification.
- 社団法人電子情報通信学会の論文
- 2007-07-19
著者
-
Akagi Masato
School of Information Science, Japan Advanced Institute of Science and Technology
-
Nguyen Binh
School Of Information Sci. Japan Advanced Inst. Of Sci. And Technol.
-
Akagi Masato
School Of Information Science Japan Advanced Institute Of Science And Technology
-
Akagi Masato
School Of Information Sci. Japan Advanced Inst. Of Sci. And Technol. (jaist) 1-1 Asahidai Nomi Ishik
-
Nguyen Binh
School Of Information Science Japan Advanced Institute Of Science And Technology
-
赤木 正人
School Of Information Science Japan Advanced Institute Of Science And Technology
-
Akagi Masato
School Of Information Sci. Japan Advanced Inst. Of Sci. And Technol.
関連論文
- A DOA estimation algorithm based on equalization-cancellation theory (応用音響)
- A study on the LP-based blind model in restoring bone-conducted speech (Speech) -- (国際ワークショップ"Asian workshop on speech science and technology")
- An LP-based blind restoration method for improving intelligibility of bone-conducted speech (音声)
- A flexible spectral modification method based on temporal decomposition and Gaussian mixture model
- Limited error based event localizing temporal decomposition and its application to variable-rate speech coding
- 加法的に付加された雑音により生じた歪を評価するための聴覚特性を考慮したスペクトル歪
- A speech dereverberation method based on the MTF concept in power envelope restoration
- An improved method based on the MTF concept for restoring the power envelope from a reverberant signal
- A DOA estimation algorithm based on equalization-cancellation theory (応用音響)
- Effects of single-channel speech enhancement algorithms on Mandarin speech intelligibility (応用音響)
- Improvement of robustness using selective sound segregation for automatic speech recognition systems in noisy environments (Speech) -- (国際ワークショップ"Asian workshop on speech science and technology")
- LP-baesd method of blind restoration to improve intelligibility of bone-conducted speech
- A Noise Reduction System in Localized and Non-Localized Noise Environments
- Noise reduction method based on generalized subtractive beamformer
- Fundamental frequency estimation for noisy speech based on instantaneous amplitude and frequency
- A Noise Reduction Method Based on a Generalized Subtractive Beamformer
- Comparative evaluation of modulation-transfer-function-based blind restoration of sub-band power envelopes of speech as a front-end processor for automatic speech recognition systems
- Sub-Band Temporal Envelope Restoration for ASR in Reverberation Environment (国際ワークショップ Frontiers in Speech and Hearing Research)
- A study on expressive speech and perception of semantic primitives: comparison between Taiwanese and Japanese (音声)
- A flexible temporal decomposition-based spectral modification method using asymmetric Gaussian mixture model (音声)
- A Study on Restoration of Bone-Conducted Speech with LPC-Based Model (国際ワークショップ Frontiers in Speech and Hearing Research)
- スペクトル包絡における個人性について
- A computational model of co-modulation masking release
- A method of signal extraction from noisy signal based on auditory scene analysis
- Modified Restricted Temporal Decomposition and Its Application to Low Rate Speech Coding
- Foreword to the special issue on "Applied Systems"
- Evaluations of TS-BASE for speech enhancement and binaural benefits preservation (応用音響)
- Adaptive β-order Generalized Spectral Subtraction for Speech Enhancement
- A Two-Microphone Noise Reduction Method in Highly Non-stationary Multiple-Noise-Source Environments
- A Hybrid Speech Emotion Recognition System Based on Spectral and Prosodic Features
- 会長就任にあたって : 新たな四半世紀に向けて計画から実行へ
- 基本周波数パターンに含まれる個人性とその制御
- Adaptive equalization-cancellation model and its application to sound localization in noisy reverberant environments