N-gram Adaptation Using Dirichlet Class Language Model Based on Part-of-Speech for Speech Recognition

Ali Hatami; Ahmad Akbari; Babak Nasersharif

N-gram Adaptation Using Dirichlet Class Language Model Based on Part-of-Speech for Speech Recognition

Publish place: 21th Iranian Conference on Electric Engineering

Publish Year: 1392

نوع سند: مقاله کنفرانسی

زبان: English

This Paper With 5 Page And PDF Format Ready To Download

دریافت فایل کامل Paper

Certificate
من نویسنده این مقاله هستم

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این Paper:

https://civilica.com/doc/208377

شناسه ملی سند علمی:

ICEE21_320

تاریخ نمایه سازی: 27 مرداد 1392

Abstract:

Language model plays an important role in automatic speech recognition (ASR) systems. Performance of this model depends on its adaptation to the linguistic features.Accordingly, adaptation methods endeavour to apply syntactic and semantic characteristics of the language for languagemodeling. The previous adaptation methods such as family ofDirichlet class language model (DCLM) extract class of history words. These methods due to lake of syntactic information arenot suitable for high morphology languages such as Farsi. This work proposes an idea for using syntactic information such aspart-of-speech (POS) in DCLM for combining with an n-gram language model. In our proposed approach, word clustering isbased on POS of previous words and history words. The performance of language models are evaluated on BijanKhan corpus using a hidden Markov model based ASR system. Our experiments show that using POS information along with history words and class of history words mproves language model, and decreases the perplexity on our corpus. Exploiting POS information along with DCLM, the word error rate of the ASR system decreases by 1% in comparison to DCLM.

Keywords:

speech recognition , language model adaptation , part-of-speech , perplexity , word error rate

Authors

Ali Hatami

Computer Engineering Department, Iran University of Science and Technology, Tehran, Iran

Ahmad Akbari

Computer Engineering Department, Iran University of Science and Technology, Tehran, Iran

Babak Nasersharif