Speech Enhancement using Greedy Dictionary Learning and Sparse Recovery

Publish Year: 1402
نوع سند: مقاله ژورنالی
زبان: English
View: 115

This Paper With 13 Page And PDF Format Ready To Download

  • Certificate
  • من نویسنده این مقاله هستم

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این Paper:

شناسه ملی سند علمی:

JR_JITM-15-5_008

تاریخ نمایه سازی: 1 آبان 1401

Abstract:

Most real-time speech signals are frequently disrupted by noise such as traffic, babbling, and background noises, among other things. The goal of speech denoising is to extract the clean speech signal from as many distorted components as possible. For speech denoising, many researchers worked on sparse representation and dictionary learning algorithms. These algorithms, however, have many disadvantages, including being overcomplete, computationally expensive, and susceptible to orthogonality restrictions, as well as a lack of arithmetic precision due to the usage of double-precision. We propose a greedy technique for dictionary learning with sparse representation to overcome these concerns. In this technique, the input signal's singular value decomposition is used to exploit orthogonality, and here the ℓ۱-ℓ۲ norm is employed to obtain sparsity to learn the dictionary. It improves dictionary learning by overcoming the orthogonality constraint, the three-sigma rule-based number of iterations, and the overcomplete nature. And this technique has resulted in improved performance as well as reduced computing complexity. With a bit-precision of Q۷ fixed-point arithmetic, this approach is also used in resource-constrained embedded systems, and the performance is considerably better than other algorithms. The greedy approach outperforms the other two in terms of SNR, Short-Time Objective Intelligibility, and computing time.

Authors

Srinivas

Research Scholar, ECE Department, JNTUK, Kakinada, India.

Santhi Prabha

Professor, ECE Department, JNTUK, Kakinada, India.

Venugopala Rao

Professor, ECE Department, K. L. University, Guntur, India.

مراجع و منابع این Paper:

لیست زیر مراجع و منابع استفاده شده در این Paper را نمایش می دهد. این مراجع به صورت کاملا ماشینی و بر اساس هوش مصنوعی استخراج شده اند و لذا ممکن است دارای اشکالاتی باشند که به مرور زمان دقت استخراج این محتوا افزایش می یابد. مراجعی که مقالات مربوط به آنها در سیویلیکا نمایه شده و پیدا شده اند، به خود Paper لینک شده اند :
  • Aharon, M., Elad, M., & Bruckstein, A. (۲۰۰۶). K-SVD: An ...
  • Beheshti, H., Daei, S., & Haddadi, F. (۲۰۱۸). Adaptive Recovery ...
  • Tang, H., Liu, H., Xiao, W., & Sebe, N. (۲۰۲۱). ...
  • Zhai, Y., Yang, Z., Liao, Z., Wright, J., & Ma, ...
  • Zhang, Y., Kuo, H. W., & Wright, J. (۲۰۲۰). Structured ...
  • نمایش کامل مراجع