Persian Phoneme Recognition using Long Short-Term Memory Neural Network

Publish Year: 1395
نوع سند: مقاله کنفرانسی
زبان: English
View: 705

This Paper With 5 Page And PDF Format Ready To Download

  • Certificate
  • من نویسنده این مقاله هستم

این Paper در بخشهای موضوعی زیر دسته بندی شده است:

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این Paper:

شناسه ملی سند علمی:

ICIKT08_046

تاریخ نمایه سازی: 5 بهمن 1395

Abstract:

Recently Recurrent Neural Networks (RNNs) have shown impressive performance in sequence classification tasks. In this paper we apply Long Short-Term Memory (LSTM) network on Persian phoneme recognition. For years Hidden Markov Model (HMM) was the dominant technique in speech recognition system but after introducing LSTM, RNNs outperformed HHM-based methods. We apply LSTM and deep LSTM on FARSDAT speech database and find that both LSTM and deep LSTM outperforms HMM in Persian phoneme recognition. Our evaluation show that deep LSTM achieves 17.55% error in FARSDAT phoneme recognition on test set which to our knowledge is the best recorded result.

Keywords:

Long Short-Term Memory (LSTM) , Farsdat , Persian Speech Recognition , Connectionist Temporal Classification (CTC)

Authors

Mohammad Daneshvar

Faculty of New Sciences and Technologies University of Tehran Tehran, Iran

Hadi Veisi

Faculty of New Sciences and Technologies University of Tehran Tehran, Iran