CIVILICA We Respect the Science
(ناشر تخصصی کنفرانسهای کشور / شماره مجوز انتشارات از وزارت فرهنگ و ارشاد اسلامی: ۸۹۷۱)

Filtering and Compression of Mel Sub-Band Energies for Speech Recognition in Noise

عنوان مقاله: Filtering and Compression of Mel Sub-Band Energies for Speech Recognition in Noise
شناسه ملی مقاله: ICEE15_132
منتشر شده در پانزدهیمن کنفرانس مهندسی برق ایران در سال 1386
مشخصات نویسندگان مقاله:

Babak Nasersharif - Computer Engineering Department, Iran University of Science and Technology
Ahmad Akbari - Computer Engineering Department, Iran University of Science and Technology
Mohantnmd Mehdi Honzayouttpour - Computer Engineering and IT department, Amirkabir University of Technology

خلاصه مقاله:
The Mel-frequency cepstral cofficients (MFCC) are commonly used in speech recognition systems. Bu they are high sensitive to presence of external noise. In this paper, we propose a noise compensation method for Mel filter bank energies and so MFCC features. This compensation method is performed in two stages: Mel sub-band filtering and then compression of Mel-sub-band energies. In the compression step, we propose a sub-bqnd SNRdependent compression function. We use this function in place of logarithm function in conventional MFCC feature extraction in presence of additive noise. Results show that the proposed nethod significantly improves MFCC features performance in noisy conditions where it decreases average word error rate up to 3094 for isolated word recognition on three test sets of Aurora 2 database.

کلمات کلیدی:
Mel sub-bands, Mel sub-band filter, SNR-dependent compression, MFCC

صفحه اختصاصی مقاله و دریافت فایل کامل: https://civilica.com/doc/25201/