ارائه یک مدل پارامتریک تطبیقی جهت کشف و رده بندی وقایع صوتی در سیگنال های محیطی

Publish Year: 1398
نوع سند: مقاله ژورنالی
زبان: English
View: 303

This Paper With 12 Page And PDF Format Ready To Download

  • Certificate
  • من نویسنده این مقاله هستم

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این Paper:

شناسه ملی سند علمی:

JR_TJEE-49-2_009

تاریخ نمایه سازی: 20 آذر 1398

Abstract:

Audio event detection (AED) is a modern way to collect data about human activities in the workplace or in other life environments. We proposed a novel adaptable model based on using two parameters, α and ᵦ to detect all audio events that may be present in a given record accompanied by their time limits in which they occur. After feature extraction and setting the values of the two key parameters, alpha and beta, the audio sequence will be sent into two distinct sub-systems for event detection. The outputs from the two sub-classifiers are then combined and necessary refinements are made on the event time limits. The final detected events are sent to the KNN classifier. The parameters serve as a trade-off tool between precision and recall expectation in the detection process. In the tests, 16 different audio events of an office room were detected, some being similar to each other and some have very similar characteristics to those of the background noise. At frame-based (FB) level, the precision rate was 70.1%, the rate of recall was 75.8%, and F1-measure was 72.8%. The F1-measure has increased by 10.8% suggesting promising applications of the model.

Authors

M. Derakhshan

Computer and IT Engineering Department, Shahrood University of Technology, Shahrood, Iran

H. Marvi

۲- Computer and IT Engineering Department, Shahrood University of Technology, Shahrood, Iran

H. Hassan poor

Computer and IT Engineering Department, Shahrood University of Technology, Shahrood, Iran

مراجع و منابع این Paper:

لیست زیر مراجع و منابع استفاده شده در این Paper را نمایش می دهد. این مراجع به صورت کاملا ماشینی و بر اساس هوش مصنوعی استخراج شده اند و لذا ممکن است دارای اشکالاتی باشند که به مرور زمان دقت استخراج این محتوا افزایش می یابد. مراجعی که مقالات مربوط به آنها در سیویلیکا نمایه شده و پیدا شده اند، به خود Paper لینک شده اند :
  • مجتبی حاجی آبادی، عباس ابراهیمی مقدم و حسین خوش بین، ... [مقاله ژورنالی]
  • مسعود گراوانچی زاده و ساناز قائمی سردرودی، بهبود کیفیت گفتار ... [مقاله ژورنالی]
  • F. Aurino, M. Folla, F. Gargiulo, V. Moscato, A. Picariello, ...
  • V. Carletti, P. Foggia, G. Percannella, A. Saggese, N. Strisciuglio, ...
  • R. Maher, Acoustical modeling of gunshots including directional information and ...
  • R. Cai, L. Lu, and A. Hanjalic, Co-clustering for Auditory ...
  • Y. Ohishi, D. Mochihashi, T. Matsui, M. Nakano, H. Kameoka, ...
  • E. Benetos, G. Lafay, M. Lagrange, and M. Plumbley, Detection ...
  • D. Stowell, D. Giannoulis, E. Benetos, M. Lagrange, and M. ...
  • R. Togneri and D. Pullella, An overview of speaker identification: ...
  • S. Pancoast and M. Akbacak, Bag-of-audio-words approach for multimedia event ...
  • A. Plinge, R. Grzeszick, and G. Fink, A Bag-of-Features approach ...
  • T. Heittola, A. Mesaros, T. Virtanen, and A. Eronen, Sound ...
  • R. Hennequin, R. Badeau and B. David, NMF with Time–Frequency ...
  • T. Komatsu, Y. Senda, and R. Kondo, Acoustic event detection ...
  • X. Lu, Y. Tsao, S. Matsuda and C. Hori, Sparse ...
  • IEEE DCASE 2016 Challenge, http://www.cs.tut.fi/sgn/arg/dcase2016/, 2016. ...
  • I. Choi, K. Kwon, S. Hyun Bae, and N. Soo ...
  • T. Hayashi, S. Watanabe, T. Toda, T. Hori, J. Le ...
  • J. Kurby, R. Grzeszick, A. Plinge, and G A. Fink, ...
  • M. Zohrer, and F. Pernkopf, Gated recurrent networks applied to ...
  • X. Zhuang, X. Zhou, M. Hasegawa-Johnson, and T. S. Huang, ...
  • E. Miquel, F. Masakiyo, S. Daisuke, O. Nobutaka, and S. ...
  • L. Vuegen, B. Van Den Broeck, P. Karsmakers, J. F. ...
  • T. Fawcett, ROC Graphs: Notes and Practical Considerations for Researchers, ...
  • J. T. Geiger, B. Schuller, and G. Rigoll, Recognizing acoustic ...
  • D. Li, J. Tam, and D. Toub, Auditory scene classification ...
  • X. Zhou, X. Zhuang, M. Liu, H. Tang, M. Hasegawa-Johnson, ...
  • A. Mesaros, T. Heittola, A. Eronen, and T. Virtanen, Acoustic ...
  • W. Nogueira, G. Roma, and P. Herrera, Automatic event classification ...
  • M. E. Niessen, T. L. M. V. Kasteren, and A. ...
  • J. F. Gemmeke, L. Vuegen, P. Karsmakers, B. Vanrumste, and ...
  • L. Vuegena, B. V. D. Broeck, P. Karsmakers, J. F. ...
  • J. Schröder, B. Cauchi, M. R. Schädler, N. Moritz, K. ...
  • A. Diment, T. Heittola, and T. Virtanen, Sound event detection ...
  • J. Schroder, S. Goetze, and J. Anemuller, Spectro-Temporal Gabor Filterbank ...
  • نمایش کامل مراجع