Robust Speech Recognition Based on Mixed Histogram Transform and Asymmetric Noise Suppression

Hassan Farsi; Samana Kuhimoghadam

Robust Speech Recognition Based on Mixed Histogram Transform and Asymmetric Noise Suppression

Publish place: majlesi Journal of Electrical Engineering، Vol: 7، Issue: 2

Publish Year: 1392

نوع سند: مقاله ژورنالی

زبان: English

This Paper With 11 Page And PDF Format Ready To Download

دریافت فایل کامل Paper

Certificate
من نویسنده این مقاله هستم

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این Paper:

https://civilica.com/doc/1795322

شناسه ملی سند علمی:

JR_MJEE-7-2_001

تاریخ نمایه سازی: 3 آبان 1402

Abstract:

This paper proposes a new feature extraction algorithm which is robust against noise using histogram compensation and asymmetric filter. Temporal masking would be provided to improve ASR systems specifically in matched and multistyle training conditions. Nonlinear filtering and temporal masking are used in this algorithm. By matching the power histograms of the input in each frequency band to those obtained over clean training data, and then mixing together the processed and unprocessed spectra can be increased appropriately speech recognition accuracy. Obtaining results show that recognition accuracy in compare with MFCC, PLP and PNCC has been improved in various training conditions.

Keywords:

en

Authors

Hassan Farsi

University of Birjand

Samana Kuhimoghadam

Department of Engineering, University of payam noor, Mashhaad

مراجع و منابع این Paper:

لیست زیر مراجع و منابع استفاده شده در این Paper را نمایش می دهد. این مراجع به صورت کاملا ماشینی و بر اساس هوش مصنوعی استخراج شده اند و لذا ممکن است دارای اشکالاتی باشند که به مرور زمان دقت استخراج این محتوا افزایش می یابد. مراجعی که مقالات مربوط به آنها در سیویلیکا نمایه شده و پیدا شده اند، به خود Paper لینک شده اند :

B. Atal, “Effectiveness of linear prediction characteristics of the speech ...
P. Jain and H. Hermansky, “Improved mean and variance normalization ...
X. Huang, A. Acero, and H-W Won, “Spoken Language Processing: ...
Y. Obuchi, N. Hataoka, and R. M. Stern, “Normalization of ...
C.Kim and R.M Stern,“Feature extraction for robust speech recognition based ...
S. Dharanipragada and M. Padmanabhan, “A nonlinear unsupervised adaptation technique ...
A. de la Torre et al., “Non-linear transformations of the ...
F. Hilger‚ “Quantile based histogram equalization for noise robust speech ...
H. Hermansky and N. Morgan, “RASTA processing of speech,” IEEE. ...
B. E. D. Kingsbury, N. Morgan, and, S. Greenberg, “Robust ...
H. G. Hirsch and C. Ehrlicher, “Noise estimation techniques or ...
C. Kim and R. M. Stern, “Nonlinear enhancement of onset ...
S. F. Boll, “Suppression of acoustic noise in speech using ...
C. Kim and R. M. Stern, “Power function-based power distribution ...
R.C. Gonzalez and R.E.Woods,“ Digital Image Processing, Pearson Prentice Hall, ...
C. Kim, K. Kumar, and R.M. Stern, “Robust speech recognition ...
M. Bijankhan and J. Sheikhzadegan, “FARSDAT – The Speech Database ...
SPIB, SPIB noise data. Available from: ...

نمایش کامل مراجع