تعداد نشریات | 418 |
تعداد شمارهها | 10,005 |
تعداد مقالات | 83,629 |
تعداد مشاهده مقاله | 78,555,193 |
تعداد دریافت فایل اصل مقاله | 55,727,791 |
Voiced-Unvoiced-Silence Detection of Speech Signal using Combined Spectro-Temporal Features | ||
Fuzzy Optimization and Modeling Journal | ||
مقاله 1، دوره 3، شماره 2، تیر 2022، صفحه 1-14 اصل مقاله (1.12 M) | ||
نوع مقاله: Original Article | ||
شناسه دیجیتال (DOI): 10.30495/fomj.2022.1961029.1071 | ||
نویسنده | ||
Nafiseh Esfandian* | ||
Department of Electrical Engineering, Qaemshahr Branch, Islamic Azad University, Qaemshahr, Iran | ||
چکیده | ||
This paper presents a new method for classification of voiced, unvoiced and silence segments of speech signal. In the proposed method, combination of spectro-temporal features is used for speech segmentation. Combined features are extracted using clustering in spectro-temporal domain. Multi-dimensional output of auditory model is clustered using weighted Gaussian mixture model. In this method, after extracting the main clusters for each frame, combined spectro-temporal features such as cluster’s energy, energy difference of clusters and minimum value of normalized cross-correlation between clusters are used for detection of voiced, unvoiced and silence regions of speech. In the proposed algorithm, speech segmentation is performed by comparing each class of features with the appropriate threshold value. Combined spectro-temporal features are used for speech segmentation in noisy conditions. The results demonstrate performance of the proposed algorithm comparing to the other features for speech segmentation. | ||
کلیدواژهها | ||
Weighted Gaussian Mixture Model؛ Clustering؛ Speech Segmentation؛ Spectro-temporal Features | ||
آمار تعداد مشاهده مقاله: 38 تعداد دریافت فایل اصل مقاله: 219 |