An Enhanced SMOTE Algorithm Using Entropy and Clustering for Imbalanced Accident Data

Publish Year: 1393
نوع سند: مقاله کنفرانسی
زبان: English
View: 547

This Paper With 6 Page And PDF Format Ready To Download

  • Certificate
  • من نویسنده این مقاله هستم

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این Paper:

شناسه ملی سند علمی:

CITCONF02_513

تاریخ نمایه سازی: 19 اردیبهشت 1395

Abstract:

Over the course of the century, many real-world applications of imbalanced data are emerged. One of its implication which is first considered in this context, is imbalanced accident data. In this paper, the data of transportation and accidents in Tehran-Bazargan highway between 2010 and 2015 is considered. In the pre-processing step, SMOTE is considered as one of the most important over-sampling technique that effectively balance the imbalanced data. However, it brings noise and other problems and a great need is felt for improving this method. To solve these problems, several techniques have been proposed in this study such as combination of dynamic selected, weighted attribute and distance weighted techniques along with mixture of classification and clustering techniques. Performance of the proposed algorithm is measured by f-measure and ROC curve and the results are compared by Weka’s SMOTE with different algorithms.

Authors

Sima Sharifirad

Master student of computer science, AmirKabir University

Azra Nazari

Graduate student of master of computer science, AmirKabir University

Mahdi Ghatee

Assistant professor of computer science, AmirKabir University

مراجع و منابع این Paper:

لیست زیر مراجع و منابع استفاده شده در این Paper را نمایش می دهد. این مراجع به صورت کاملا ماشینی و بر اساس هوش مصنوعی استخراج شده اند و لذا ممکن است دارای اشکالاتی باشند که به مرور زمان دقت استخراج این محتوا افزایش می یابد. مراجعی که مقالات مربوط به آنها در سیویلیکا نمایه شده و پیدا شده اند، به خود Paper لینک شده اند :
  • J. Laurikkala, (2001), "Improving Identification of Difficult Small Classes by ...
  • Chawla, N.V., K. Bowyer, Hall.l and Kegelmeyer , W.(2002), SMOTE: ...
  • Lopez, V. Fernandez, A and Garcfa, S.I, (2013) _ insight ...
  • European Transport Safety Council, (2013). Back on track to reach ...
  • World Health Report: Making a difference. Geneva, (1999), World Health ...
  • .Raj aNews.com, (1 393). ...
  • Wu, J., S.C. Brubaker, M.D. Mullin and J.M. Rehg, (2008)." ...
  • He, H.B. and E.A. Garcia, 2009. Learning from imibalanced data. ...
  • Ying, ..(201 2), "Imbalanced classification based On Active Learning SMOTE, ...
  • Lewis, D. and W. Gale, (1998). "Training text classifiere by ...
  • Ling, C. and Li, C. (1998). "Data Mining for Direct ...
  • Japkowicz, N. (Ed.). (200). Proceedings of the AAAI 200) Workshop ...
  • Nitesh V.C et al, (2002), ;" SMOTE: Synthetic Minority Oversampling ...
  • H. Han, W.Y. Wang, and B.H. Mao, (2005) _ orderline- ...
  • M. Kubat and S. Matwin, (1997) "Addressing the Curse of ...
  • G.E.A.P.A. Batista, R.C. Prati, and M.C. Monard, (204), "A Study ...
  • X. Xiao, and H. Ding, (2012), "Enhancemet of K-nearest Neighbor ...
  • L. Jiang, Z. Cai, D. Wang, and S. Jiang, (2007)Survey ...
  • J. Wu, Z. Cai and Z. Gao, (20 10), "Dynamic ...
  • O.Kwon, W.Rhee and Y.Yoon, (201 5), "Application of classification algorithms ...
  • نمایش کامل مراجع