Extract features from Persian speech signal for emotion recognition

Publish Year: 1400
نوع سند: مقاله کنفرانسی
زبان: English
View: 432

This Paper With 6 Page And PDF Format Ready To Download

  • Certificate
  • من نویسنده این مقاله هستم

این Paper در بخشهای موضوعی زیر دسته بندی شده است:

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این Paper:

شناسه ملی سند علمی:

ITCT12_106

تاریخ نمایه سازی: 7 شهریور 1400

Abstract:

Establishing verbal communication between man and machine, as well as understanding the human emotions by the machine and providing an appropriate response to it requires understanding the speech and emotions of the speaker, in this regard, research in the field of emotion recognition in speech signal is necessary. In this paper, an attempt is made to design and implement a system to determine and detect emotions of anger and happiness in the Persian speech signal. Research on recognizing some emotions has been done in most languages, but due to the difficulty of creating a speech database, so far little research has been done to identify emotions in Persian speech. In this article, due to the lack of a suitable database in Persian to detect emotions, at first, a database for moods of happiness and anger and neutral (without any emotion) in Persian, consisting of ۷۲۰ sentences was created. Then the frequency characteristics of speech signals obtained from Fourier transform such as maximum, minimum, median and mean as well as LPC coefficients were extracted. Then, the MLP neural network was used to detect emotions of happiness and anger with an average accuracy of ۸۷.۷۴%.

Keywords:

Authors

Seyed Mehdi Hoseini

Department of Computer Science, University of Mazandaran, Babolsar, Iran