ارائه یک مدل پارامتریک تطبیقی جهت کشف و رده بندی وقایع صوتی در سیگنال های محیطی

M. Derakhshan; H. Marvi; H. Hassan poor

ارائه یک مدل پارامتریک تطبیقی جهت کشف و رده بندی وقایع صوتی در سیگنال های محیطی

Publish place: Tabriz Journal of Electrical Engineering، Vol: 49، Issue: 2

Publish Year: 1398

نوع سند: مقاله ژورنالی

زبان: English

This Paper With 12 Page And PDF Format Ready To Download

دریافت فایل کامل Paper

Certificate
من نویسنده این مقاله هستم

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این Paper:

https://civilica.com/doc/966358

شناسه ملی سند علمی:

JR_TJEE-49-2_009

تاریخ نمایه سازی: 20 آذر 1398

Abstract:

Audio event detection (AED) is a modern way to collect data about human activities in the workplace or in other life environments. We proposed a novel adaptable model based on using two parameters, α and ᵦ to detect all audio events that may be present in a given record accompanied by their time limits in which they occur. After feature extraction and setting the values of the two key parameters, alpha and beta, the audio sequence will be sent into two distinct sub-systems for event detection. The outputs from the two sub-classifiers are then combined and necessary refinements are made on the event time limits. The final detected events are sent to the KNN classifier. The parameters serve as a trade-off tool between precision and recall expectation in the detection process. In the tests, 16 different audio events of an office room were detected, some being similar to each other and some have very similar characteristics to those of the background noise. At frame-based (FB) level, the precision rate was 70.1%, the rate of recall was 75.8%, and F1-measure was 72.8%. The F1-measure has increased by 10.8% suggesting promising applications of the model.

Keywords:

Audio event detection (AED) , environmental sounds , unsupervised learning , adaptable modeling systems , audio monitoring systems , audio-based acquisition systems

Authors

M. Derakhshan

Computer and IT Engineering Department, Shahrood University of Technology, Shahrood, Iran

H. Marvi

۲- Computer and IT Engineering Department, Shahrood University of Technology, Shahrood, Iran

H. Hassan poor

Computer and IT Engineering Department, Shahrood University of Technology, Shahrood, Iran

مراجع و منابع این Paper:

لیست زیر مراجع و منابع استفاده شده در این Paper را نمایش می دهد. این مراجع به صورت کاملا ماشینی و بر اساس هوش مصنوعی استخراج شده اند و لذا ممکن است دارای اشکالاتی باشند که به مرور زمان دقت استخراج این محتوا افزایش می یابد. مراجعی که مقالات مربوط به آنها در سیویلیکا نمایه شده و پیدا شده اند، به خود Paper لینک شده اند :

مجتبی حاجی آبادی، عباس ابراهیمی مقدم و حسین خوش بین، ... [مقاله ژورنالی]
مسعود گراوانچی زاده و ساناز قائمی سردرودی، بهبود کیفیت گفتار ... [مقاله ژورنالی]
F. Aurino, M. Folla, F. Gargiulo, V. Moscato, A. Picariello, ...
V. Carletti, P. Foggia, G. Percannella, A. Saggese, N. Strisciuglio, ...
R. Maher, Acoustical modeling of gunshots including directional information and ...
R. Cai, L. Lu, and A. Hanjalic, Co-clustering for Auditory ...
Y. Ohishi, D. Mochihashi, T. Matsui, M. Nakano, H. Kameoka, ...
E. Benetos, G. Lafay, M. Lagrange, and M. Plumbley, Detection ...
D. Stowell, D. Giannoulis, E. Benetos, M. Lagrange, and M. ...
R. Togneri and D. Pullella, An overview of speaker identification: ...
S. Pancoast and M. Akbacak, Bag-of-audio-words approach for multimedia event ...
A. Plinge, R. Grzeszick, and G. Fink, A Bag-of-Features approach ...
T. Heittola, A. Mesaros, T. Virtanen, and A. Eronen, Sound ...
R. Hennequin, R. Badeau and B. David, NMF with Time–Frequency ...
T. Komatsu, Y. Senda, and R. Kondo, Acoustic event detection ...
X. Lu, Y. Tsao, S. Matsuda and C. Hori, Sparse ...
IEEE DCASE 2016 Challenge, http://www.cs.tut.fi/sgn/arg/dcase2016/, 2016. ...
I. Choi, K. Kwon, S. Hyun Bae, and N. Soo ...
T. Hayashi, S. Watanabe, T. Toda, T. Hori, J. Le ...
J. Kurby, R. Grzeszick, A. Plinge, and G A. Fink, ...
M. Zohrer, and F. Pernkopf, Gated recurrent networks applied to ...
X. Zhuang, X. Zhou, M. Hasegawa-Johnson, and T. S. Huang, ...
E. Miquel, F. Masakiyo, S. Daisuke, O. Nobutaka, and S. ...
L. Vuegen, B. Van Den Broeck, P. Karsmakers, J. F. ...
T. Fawcett, ROC Graphs: Notes and Practical Considerations for Researchers, ...
J. T. Geiger, B. Schuller, and G. Rigoll, Recognizing acoustic ...
D. Li, J. Tam, and D. Toub, Auditory scene classification ...
X. Zhou, X. Zhuang, M. Liu, H. Tang, M. Hasegawa-Johnson, ...
A. Mesaros, T. Heittola, A. Eronen, and T. Virtanen, Acoustic ...
W. Nogueira, G. Roma, and P. Herrera, Automatic event classification ...
M. E. Niessen, T. L. M. V. Kasteren, and A. ...
J. F. Gemmeke, L. Vuegen, P. Karsmakers, B. Vanrumste, and ...
L. Vuegena, B. V. D. Broeck, P. Karsmakers, J. F. ...
J. Schröder, B. Cauchi, M. R. Schädler, N. Moritz, K. ...
A. Diment, T. Heittola, and T. Virtanen, Sound event detection ...
J. Schroder, S. Goetze, and J. Anemuller, Spectro-Temporal Gabor Filterbank ...

نمایش کامل مراجع