SEMI-SYLLABLE UNITS FOR ROBUST INDEPENDENTSPEAKER IDENFICATION
Publish Year: 1395
نوع سند: مقاله کنفرانسی
زبان: English
View: 323
This Paper With 11 Page And PDF Format Ready To Download
- Certificate
- من نویسنده این مقاله هستم
استخراج به نرم افزارهای پژوهشی:
شناسه ملی سند علمی:
DSCONF03_034
تاریخ نمایه سازی: 19 خرداد 1396
Abstract:
Abstract In this study, Robust text-independent speaker identification is investigated. Syllable and semisyllable boundaries are automatically detected in Farsi continuous speech utterances using short-term energy contour and discrete wavelet transform (DWT). While the entire syllable is considered as the unit for prosody, wavelet entropy coefficients are emerged from two overlapping semi-syllables reflecting consonant/vowel (CV) and (or) vowel/consonant (VC) transitions distinctly. Long-term prosodic features i.e. rational syllable nuclei duration, mean energy; pitch frequency and, four formants in addition to concatenated coefficients of wavelet entropy in depthfour are extracted as the feature vector. Classification is performed by the feed-forward perceptron neural network (FFPNN) with two hidden layers. The experiments conducted on Farsi speech dataset (FarsDat) using proposed method confirm improvement in speaker identification accuracy in different signal to noise ratios compared with conventional methods
Keywords:
Authors
Hamed Aghili
Department of Computer and Information technology (Robotic engineering),Payame Noor University (PNU), IRAN
مراجع و منابع این Paper:
لیست زیر مراجع و منابع استفاده شده در این Paper را نمایش می دهد. این مراجع به صورت کاملا ماشینی و بر اساس هوش مصنوعی استخراج شده اند و لذا ممکن است دارای اشکالاتی باشند که به مرور زمان دقت استخراج این محتوا افزایش می یابد. مراجعی که مقالات مربوط به آنها در سیویلیکا نمایه شده و پیدا شده اند، به خود Paper لینک شده اند :