Supervised Single-Channel Speech Separation via Sparse Decomposition Using Periodic Signal Models
スポンサーリンク
概要
- 論文の詳細を見る
In this paper, we propose a method for supervised single-channel speech separation through sparse decomposition using periodic signal models. The proposed separation method employs sparse decomposition, which decomposes a signal into a set of periodic signals under a sparsity penalty. In order to achieve separation through sparse decomposition, the decomposed periodic signals have to be assigned to the corresponding sources. For the assignment of the periodic signal, we introduce clustering using a K-means algorithm to group the decomposed periodic signals into as many clusters as the number of speakers. After the clustering, each cluster is assigned to its corresponding speaker using preliminarily learnt codebooks. Through separation experiments, we compare our method with MaxVQ, which performs separation on the frequency spectrum domain. The experimental results in terms of signal-to-distortion ratio show that the proposed sparse decomposition method is comparable to the frequency domain approach and has less computational costs for assignment of speech components.
著者
-
Iiguni Youji
Graduate School Of Engineering Science Osaka University
-
Nakashizuka Makoto
Graduate School Of Bio-application And Systems Engineering Tokyo University Of Agriculture And Techn
-
Okumura Hiroyuki
Graduate School Of Engineering Science Osaka University
関連論文
- Improvement of Speech Quality in Distance-Based Howling Canceller
- Moment Invariants of the Weighted Image
- Speech Enhancement Based on MAP Estimation Using a Variable Speech Distribution(Papers Selected from the 21st Symposium on Signal Processing)
- Image Enlargement by Nonlinear Frequency Extrapolation with Morphological Operators
- Shift-Invariant Sparse Image Representations Using Tree-Structured Dictionaries
- Edge Enhancement by the Wavelet Maxima and Its Application to Image Enlargement
- A Sparse Decomposition Method for Periodic Signal Mixtures
- Image Contour Clustering by Vector Quantization on Multiscale Gradient Planes and Its Application to Image Coding(Special Section on Digital Signal Processing)
- A High Speech Quality Distance-Based Howling Canceller with Adaptive Cascade Notch Filter and Silent Pilot Signal
- Convergence Vectors in System Identification with an NLMS Algorithm for Sinusoidal Inputs
- Convergence vector of normalized least-mean-square algorithm for predicting deterministic sinusoidal signals
- Supervised Single-Channel Speech Separation via Sparse Decomposition Using Periodic Signal Models
- Stationary and Non-stationary Wide-Band Noise Reduction Using Zero Phase Signal
- Supervised Single-Channel Speech Separation via Sparse Decomposition Using Periodic Signal Models
- Single channel blind source separation of deterministic sinusoidal signals with independent component analysis
- An Adaptation Method for Morphological Opening Filters with a Smoothness Penalty on Structuring Elements
- A Comb Filter with Adaptive Notch Gain and Bandwidth
- An Adaptive Comb Filter with Flexible Notch Gain