سیویلیکا را در شبکه های اجتماعی دنبال نمایید.

Feature mapping using deep belief networks for robust speech recognition

Publish Year: 1393
Type: Journal paper
Language: English
View: 35

This Paper With 7 Page And PDF Format Ready To Download

Export:

Link to this Paper:

Document National Code:

JR_MJEEMO-14-3_003

Index date: 11 March 2025

Feature mapping using deep belief networks for robust speech recognition abstract

Performance of automatic speech recognition (ASR) systems degrades in noisy conditions due to mismatch between training and test environments. Many methods have been proposed for reducing this mismatch in ASR systems. In recent years, deep neural networks (DNNs) have been widely used in ASR systems and also robust speech recognition and feature extraction. In this paper, we propose to use deep belief network (DBN) as a post-processing method for de-noising Mel frequency cepstral coefficients (MFCCs). In addition, we use deep belief network for extracting tandem features (posterior probability of phones occurrence) from de-noised MFCCs (obtained from previous stage) to obtain more robust and discriminative features. The final robust feature vector consists of de-noised MFCCs concatenated to mentioned tandem features. Evaluation results on Aurora2 database show that the proposed feature vector performs better than similar and conventional techniques, where it increases recognition accuracy in average by 28% in comparison to MFCCs.

Feature mapping using deep belief networks for robust speech recognition authors

مجتبی غلامی پور

MSc in Artificial Intelligence from the School of Computer Engineering University of Technology, Tusi

بابک ناصرشریف

Assistant Professor Department of Computer Engineering, K.N.Toosi University of Technology.