Persian Phoneme and Syllable Recognition using Recurrent Neural Networks for Phonological Awareness Assessment
Publish Year: 1401
نوع سند: مقاله ژورنالی
زبان: English
View: 199
This Paper With 11 Page And PDF Format Ready To Download
- Certificate
- من نویسنده این مقاله هستم
این Paper در بخشهای موضوعی زیر دسته بندی شده است:
استخراج به نرم افزارهای پژوهشی:
شناسه ملی سند علمی:
JR_JADM-10-1_010
تاریخ نمایه سازی: 21 فروردین 1401
Abstract:
One of the main problems in children with learning difficulties is the weakness of phonological awareness (PA) skills. In this regard, PA tests are used to evaluate this skill. Currently, this assessment is paper-based for the Persian language. To accelerate the process of the assessments and make it engaging for children, we propose a computer-based solution that is a comprehensive Persian phonological awareness assessment system implementing expressive and pointing tasks. For the expressive tasks, the solution is powered by recurrent neural network-based speech recognition systems. To this end, various recognition modules are implemented, including a phoneme recognition system for the phoneme segmentation task, a syllable recognition system for the syllable segmentation task, and a sub-word recognition system for three types of phoneme deletion tasks, including initial, middle, and final phoneme deletion. The recognition systems use bidirectional long short-term memory neural networks to construct acoustic models. To implement the recognition systems, we designed and collected Persian Kid’s Speech Corpus that is the largest in Persian for children’s speech. The accuracy rate for phoneme recognition was ۸۵.۵%, and for syllable recognition was ۸۹.۴%. The accuracy rates of the initial, middle, and final phoneme deletion were ۹۶.۷۶%, ۹۸.۲۱%, and ۹۵.۹%, respectively.
Keywords:
Speech Therapy , Phonological Awareness Assessment , Kid’s Speech Recognition , Persian Phoneme and Syllable Recognition , Long Short-Term Memory Neural Network
Authors
M. Khanzadi
Faculty of New Sciences and Technologies, University of Tehran, Tehran, Iran.
H. Veisi
Faculty of New Sciences and Technologies, University of Tehran, Tehran, Iran.
R. Alinaghizade
Faculty of New Sciences and Technologies, University of Tehran, Tehran, Iran.
Z. Soleymani
Department of Speech Therapy, School of Rehabilitation, Tehran University of Medical Sciences, Tehran, Iran.
مراجع و منابع این Paper:
لیست زیر مراجع و منابع استفاده شده در این Paper را نمایش می دهد. این مراجع به صورت کاملا ماشینی و بر اساس هوش مصنوعی استخراج شده اند و لذا ممکن است دارای اشکالاتی باشند که به مرور زمان دقت استخراج این محتوا افزایش می یابد. مراجعی که مقالات مربوط به آنها در سیویلیکا نمایه شده و پیدا شده اند، به خود Paper لینک شده اند :