An End-to-End Deep Learning Model to Recognize Farsi Speech from Raw Input

Sina Alisamir; Seyed Mohammad Ahadi; Sanaz Seyedin

سیویلیکا را در شبکه های اجتماعی دنبال نمایید.

Advanced search Thesis

Papers Conferences Journals

An End-to-End Deep Learning Model to Recognize Farsi Speech from Raw Input

Publish place: 4TH IRANIAN CONFERENCE ON SIGNAL PROCESSING AND INTELLIGENT SYSTEMS

Publish Year: 1397

Type: Conference paper

Language: English

این Paper فقط به صورت چکیده توسط دبیرخانه ارسال شده است و فایل کامل قابل دریافت نیست. برای یافتن Papers دارای فایل کامل، از بخش [جستجوی مقالات فارسی] اقدام فرمایید.

نسخه کامل این Paper ارائه نشده است و در دسترس نمی باشد

Certificate
I'm the author of the paper

این Paper در بخشهای موضوعی زیر دسته بندی شده است:

هوش مصنوعی > یادگیری عمیق

Export:

Link to this Paper:

https://civilica.com/doc/842943

Document National Code:

SPIS04_028

Index date: 6 May 2019

An End-to-End Deep Learning Model to Recognize Farsi Speech from Raw Input abstract

Automatic speech recognition systems usually solve the problem of recognizing speech by dividing the problem into different independent stages. First, they extract speech features and then use an acoustic model to reach the phoneme probabilities and from those probabilities, they reach sequence of recognized words. Recent advances in technology, especially in the area of deep neural networks in combination with speech recognition, shows that this division is not necessary and we can reach sequence of alphabet letters straight from the raw signal. In this work, we implemented and tested an endto- end convolutional neural network system with raw input for Farsi speech recognition and then compared its performance to another system that uses MFCC features. We show that using an end-to-end system with our configuration,which reaches series of phonemes from raw speech works better for Farsi speech as well as for English.

An End-to-End Deep Learning Model to Recognize Farsi Speech from Raw Input authors

Sina Alisamir

Seyed Mohammad Ahadi

Sanaz Seyedin

Certificate
I'm the author of the paper

این Paper در بخشهای موضوعی زیر دسته بندی شده است:

هوش مصنوعی > یادگیری عمیق

Export:

Link to this Paper:

https://civilica.com/doc/842943

Document National Code:

SPIS04_028

Index date: 6 May 2019

How to cite:

If you want to refer to this Paper in your research work, you can simply use the following phrase in the references section:

Alisamir, Sina and Ahadi, Seyed Mohammad and Seyedin, Sanaz,1397,An End-to-End Deep Learning Model to Recognize Farsi Speech from Raw Input,4TH IRANIAN CONFERENCE ON SIGNAL PROCESSING AND INTELLIGENT SYSTEMS,Tehran,https://civilica.com/doc/842943

در داخل متن نیز هر جا که به عبارت و یا دستاوردی از این Paper اشاره شود پس از ذکر مطلب، در داخل پارانتز، مشخصات زیر نوشته می شود.
برای بار اول: (1397, Alisamir, Sina؛ Seyed Mohammad Ahadi and Sanaz Seyedin)
برای بار دوم به بعد: (1397, Alisamir؛ Ahadi and Seyedin)
برای آشنایی کامل با نحوه مرجع نویسی لطفا بخش راهنمای سیویلیکا (مرجع دهی) را ملاحظه نمایید.