Speech Recognition System Based on Machine Learning in Persian Language
Publish Year: 1401
نوع سند: مقاله ژورنالی
زبان: English
View: 222
This Paper With 12 Page And PDF Format Ready To Download
- Certificate
- من نویسنده این مقاله هستم
این Paper در بخشهای موضوعی زیر دسته بندی شده است:
استخراج به نرم افزارهای پژوهشی:
شناسه ملی سند علمی:
JR_CAND-1-2_003
تاریخ نمایه سازی: 28 دی 1401
Abstract:
In today's world, where speech recognition has become an integral part of our daily lives, the need for systems equipped with this technology has increased dramatically in the past few years. This research aims to locate the two selected Persian words in any given audio file. For this purpose, two standard and native datasets were prepared for this model one for train and the other for the test. Both datasets were converted into images of audio waveforms. Using the object detection technique, the model could extract different bounding boxes for each test audio, and then each box image goes through a CNN classifier and returns a corresponding label. Finally, a threshold is set so that only boxes with high accuracy are displayed as output. The results showed ۹۳% accuracy for the CNN classifier and ۵۰% accuracy for testing the model with object detection.
Keywords:
Authors
Shahed Mohammadi
Department of Computer Since and Systems Engineering, Ayandegan Institute of Higher Education, Tonekabon, Iran.
Niloufar Hemati
Department of Computer Science, Islamic Azad University Central Tehran Branch, Tehran, Iran.
Neda Mohammadi
Department of Industrial Engineering, Sadra University, Tehran, Iran.
مراجع و منابع این Paper:
لیست زیر مراجع و منابع استفاده شده در این Paper را نمایش می دهد. این مراجع به صورت کاملا ماشینی و بر اساس هوش مصنوعی استخراج شده اند و لذا ممکن است دارای اشکالاتی باشند که به مرور زمان دقت استخراج این محتوا افزایش می یابد. مراجعی که مقالات مربوط به آنها در سیویلیکا نمایه شده و پیدا شده اند، به خود Paper لینک شده اند :