Selective Sampling via Similarity Factor for Sequential Imbalanced Data

Publish Year: 1400
نوع سند: مقاله کنفرانسی
زبان: English
View: 413

This Paper With 7 Page And PDF and WORD Format Ready To Download

  • Certificate
  • من نویسنده این مقاله هستم

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این Paper:

شناسه ملی سند علمی:

TECCONF05_037

تاریخ نمایه سازی: 11 مهر 1400

Abstract:

Considering labeling problems in some online processes, sampling strategies with limited budget would save both time and cost for the operator; while keeping results in an acceptable range. Selective sampling has made supervised learning possible for some expensive or inapplicable online processes. A good sampling strategy give this opportunity to label prediction system to have labels that will improve system’s accuracy. An ideal sampling strategywould query labels of data points that have been classified in wrong class. Sampling problem has common characteristics with anomaly detection problem. This paper proposes a selective sampling method based on principal curves. Proposed algorithm returns a similarity factor between input stream and existing structures of predictor system in each step. No randomness is involved in the proposed algorithm to make it applicable in sensitive real-world problems.

Authors

Aref Hakimzadeh

School of Electrical and Computer Engineering Shiraz University Shiraz, Fars

Koorush Ziarati

School of Electrical and Computer Engineering Shiraz University Shiraz, Fars