A x-vector based Speaker Recognition in Persian
Publish Year: 1403
Type: Journal paper
Language: English
View: 64
This Paper With 13 Page And PDF Format Ready To Download
- Certificate
- I'm the author of the paper
Export:
Document National Code:
JR_EAR-1-2_007
Index date: 10 December 2024
A x-vector based Speaker Recognition in Persian abstract
In this paper, a text-independent speaker recognition system in Persian is implemented by deep neural networks. The x-vector technique based on Time Delay Neural Network (TDNN) is used to extract the embeddings from speech signals. This method attracts researcher’s attention due to noise robustness and high performance. Data augmentation and noise addition are used to improve system performance. The PLDA classifier is used to recognize the speaker. Previous research in the field of “speaker recognition in Persian” is limited. In this work, the network is trained on the Persian part of the CommonVoice dataset. According to the error analysis, non-speech parts of an utterance decrease the accuracy of speaker recognition. So, the non-speech parts are removed by a Convolutional Recurrent Deep Neural Networks (CRDNN). The accuracy of speaker recognition and verification in CommonVoice is 95.24% and 95.56%, respectively. The Equal Error Rate (EER) evaluation metric of the speaker verification system is 4.72%. The attendance monitoring system was developed as one of the applications of the speaker recognition system. System accuracy for 12 and 15 seconds of collected data(includes 16 women and 12 men) is 98.92% and 100%, respectivly.
A x-vector based Speaker Recognition in Persian Keywords:
A x-vector based Speaker Recognition in Persian authors
fatemeh shahbakhti
Department of Electrical and Computer Engineering, Faculty of Shariaty, Skill National University (nus), Tehran, Iran
maryam Moradi-Shabestari
Electrical and Computer Engineering Department, Tehran University, Tehran, Iran
Zeinab Ghasemi-Naraghi
Computer Engineering Department, AmirKabir University of Technology, Tehran, Iran
مراجع و منابع این Paper:
لیست زیر مراجع و منابع استفاده شده در این Paper را نمایش می دهد. این مراجع به صورت کاملا ماشینی و بر اساس هوش مصنوعی استخراج شده اند و لذا ممکن است دارای اشکالاتی باشند که به مرور زمان دقت استخراج این محتوا افزایش می یابد. مراجعی که مقالات مربوط به آنها در سیویلیکا نمایه شده و پیدا شده اند، به خود Paper لینک شده اند :