Voiced-Unvoiced-Silence Detection of Speech Signal using Combined Spectro-Temporal Features

Esfandian, Nafiseh

doi:10.30495/fomj.2022.1961029.1071

همایش سردبیران نشریات علمی دانشگاه آزاد اسلامی

سامانه یکپارچه نشریات علمی دانشگاه آزاد اسلامی

تعداد نشریات	418
تعداد شماره‌ها	10,013
تعداد مقالات	83,708
تعداد مشاهده مقاله	79,623,850
تعداد دریافت فایل اصل مقاله	56,297,888

	Voiced-Unvoiced-Silence Detection of Speech Signal using Combined Spectro-Temporal Features
Fuzzy Optimization and Modeling Journal
مقاله 1، دوره 3، شماره 2، تیر 2022، صفحه 1-14 اصل مقاله (1.12 M)
نوع مقاله: Original Article
شناسه دیجیتال (DOI): 10.30495/fomj.2022.1961029.1071
نویسنده
Nafiseh Esfandian^*
Department of Electrical Engineering, Qaemshahr Branch, Islamic Azad University, Qaemshahr, Iran
چکیده
This paper presents a new method for classification of voiced, unvoiced and silence segments of speech signal. In the proposed method, combination of spectro-temporal features is used for speech segmentation. Combined features are extracted using clustering in spectro-temporal domain. Multi-dimensional output of auditory model is clustered using weighted Gaussian mixture model. In this method, after extracting the main clusters for each frame, combined spectro-temporal features such as cluster’s energy, energy difference of clusters and minimum value of normalized cross-correlation between clusters are used for detection of voiced, unvoiced and silence regions of speech. In the proposed algorithm, speech segmentation is performed by comparing each class of features with the appropriate threshold value. Combined spectro-temporal features are used for speech segmentation in noisy conditions. The results demonstrate performance of the proposed algorithm comparing to the other features for speech segmentation.
کلیدواژه‌ها
Weighted Gaussian Mixture Model؛ Clustering؛ Speech Segmentation؛ Spectro-temporal Features

آمار تعداد مشاهده مقاله: 39 تعداد دریافت فایل اصل مقاله: 223

سامانه مدیریت نشریات علمی. قدرت گرفته از سیناوب

پیوندهای مفید

اخبار و اعلانات

آمار

Voiced-Unvoiced-Silence Detection of Speech Signal using Combined Spectro-Temporal Features