CIVILICA We Respect the Science
(ناشر تخصصی کنفرانسهای کشور / شماره مجوز انتشارات از وزارت فرهنگ و ارشاد اسلامی: ۸۹۷۱)

Persian Phoneme and Syllable Recognition using Recurrent Neural Networks for Phonological Awareness Assessment

عنوان مقاله: Persian Phoneme and Syllable Recognition using Recurrent Neural Networks for Phonological Awareness Assessment
شناسه ملی مقاله: JR_JADM-10-1_010
منتشر شده در در سال 1401
مشخصات نویسندگان مقاله:

M. Khanzadi - Faculty of New Sciences and Technologies, University of Tehran, Tehran, Iran.
H. Veisi - Faculty of New Sciences and Technologies, University of Tehran, Tehran, Iran.
R. Alinaghizade - Faculty of New Sciences and Technologies, University of Tehran, Tehran, Iran.
Z. Soleymani - Department of Speech Therapy, School of Rehabilitation, Tehran University of Medical Sciences, Tehran, Iran.

خلاصه مقاله:
One of the main problems in children with learning difficulties is the weakness of phonological awareness (PA) skills. In this regard, PA tests are used to evaluate this skill. Currently, this assessment is paper-based for the Persian language. To accelerate the process of the assessments and make it engaging for children, we propose a computer-based solution that is a comprehensive Persian phonological awareness assessment system implementing expressive and pointing tasks. For the expressive tasks, the solution is powered by recurrent neural network-based speech recognition systems. To this end, various recognition modules are implemented, including a phoneme recognition system for the phoneme segmentation task, a syllable recognition system for the syllable segmentation task, and a sub-word recognition system for three types of phoneme deletion tasks, including initial, middle, and final phoneme deletion. The recognition systems use bidirectional long short-term memory neural networks to construct acoustic models. To implement the recognition systems, we designed and collected Persian Kid’s Speech Corpus that is the largest in Persian for children’s speech. The accuracy rate for phoneme recognition was ۸۵.۵%, and for syllable recognition was ۸۹.۴%. The accuracy rates of the initial, middle, and final phoneme deletion were ۹۶.۷۶%, ۹۸.۲۱%, and ۹۵.۹%, respectively.

کلمات کلیدی:
Speech Therapy, Phonological Awareness Assessment, Kid’s Speech Recognition, Persian Phoneme and Syllable Recognition, Long Short-Term Memory Neural Network

صفحه اختصاصی مقاله و دریافت فایل کامل: https://civilica.com/doc/1424967/