Image to Image Translation based on Convolutional Neural Network Approach for Speech Declipping

Clipping, as a current nonlinear distortion, often occurs due to the limited dynamic range of audio recorders. It degrades the speech quality and intelligibility and adverselyaffects the performances of speech and speaker recognitions. In this paper, we focus on enhancement of clipped speech by using a fully convolutional neural network as U-Net. Motivated by the idea of image-to-image translation, we propose a declipping approach, namely U-Net declipper in which the magnitude spectrum images of clipped signals are translated to the corresponding images of clean ones. The experimental results show that the proposed approach outperforms other declipping methods in terms of both quality and intelligibility measures, especially in severe clipping cases. Moreover, the superior performance of the U-Net declipper over the well-known declipping methods is verified in additive Gaussian noise conditions.

Keywords:

speech clipping , image-to-image translation , U-Net declipper , spectrum image.

Authors

Hamidreza Baradaran Kashani

Electrical Engineering Faculty Amirkabir University of Technology Tehran, Iran

Ata Jodeiri

School of Electrical & Computer Engineering University of Tehran Tehran, Iran

Mohammad Mohsen Goodarzi

Department of Biomedical Engineering, Buein Zahra Technical University, Buein Zahra, Qazvin, Iran

Shabnam Gholamdokht Firooz

School of Electrical & Computer Engineering University of Tehran Tehran, Iran

Certificate
من نویسنده این مقاله هستم

این Paper در بخشهای موضوعی زیر دسته بندی شده است:

هوش مصنوعی > شبکه عصبی

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این Paper:

https://civilica.com/doc/989081

شناسه ملی سند علمی:

ETECH04_066

تاریخ نمایه سازی: 27 بهمن 1398

How to Cite to This Paper:

If you want to refer to this Paper in your research work, you can simply use the following phrase in the resources section:

Baradaran Kashani, Hamidreza and Jodeiri, Ata and Goodarzi, Mohammad Mohsen and Gholamdokht Firooz, Shabnam,1398,Image to Image Translation based on Convolutional Neural Network Approach for Speech Declipping,Fourth National Conference on Electrical and Computer Engineering,Tehran,https://civilica.com/doc/989081

Scientometrics

The specifications of the publisher center of this Paper are as follows:

Ranking of AmirKabir University

Type of center: دانشگاه دولتی

Paper count: 25,285

In the scientometrics section of CIVILICA, you can see the scientific ranking of the Iranian academic and research centers based on the statistics of indexed articles.

مقالات مرتبط جدید