Voice Activity Detection Based on Generalized Normal-Laplace Distribution Incorporating Conditional MAP
スポンサーリンク
概要
- 論文の詳細を見る
In this paper, we propose a novel voice activity detection (VAD) algorithm based on the generalized normal-Laplace (GNL) distribution to provide enhanced performance in adverse noise environments. Specifically, the probability density function (PDF) of a noisy speech signal is represented by the GNL distribution; the variance of the speech and noise of the GNL distribution are estimated using higher-order moments. After in-depth analysis of estimated variances, a feature that is useful for discrimination between speech and noise at low SNRs is derived and compared to a threshold to detect speech activity. To consider the inter-frame correlation of speech activity, the result from the previous frame is employed in the decision rule of the proposed VAD algorithm. The performance of our proposed VAD algorithm is evaluated in terms of receiver operating characteristics (ROC) and detection accuracy. Results show that the proposed method yields better results than conventional VAD algorithms.
著者
-
Lee Sangmin
Department Of Anesthesiology And Pain Medicine Samsung Medical Center Sungkyunkwan University School
-
SONG Ji-Hyun
Department of Electronic Engineering, Inha University
関連論文
- Displacement Behavior Study of the Shear Stress Effect on the Early Viscous Flow Nature of Fe-B-Nb-Y Metallic Glassy Powder in Spark Plasma Sintering
- Excellent Thermal Stability and Bulk Glass Forming Ability of Fe-B-Nb-Y Soft Magnetic Metallic Glass
- Hemostatic and Electrolyte Effects of Hydroxyethyl Starches in Patients Undergoing Posterior Lumbar Interbody Fusion Using Pedicle Screws and Cages
- Voice Activity Detection Based on Generalized Normal-Laplace Distribution Incorporating Conditional MAP
- The effect of patient-controlled intravenous analgesia on postoperative hypokalemia in patients undergoing laparoscopic cholecystectomy