Selective Sampling via Similarity Factor for Sequential Imbalanced Data

Considering labeling problems in some online processes, sampling strategies with limited budget would save both time and cost for the operator; while keeping results in an acceptable range. Selective sampling has made supervised learning possible for some expensive or inapplicable online processes. A good sampling strategy give this opportunity to label prediction system to have labels that will improve system’s accuracy. An ideal sampling strategywould query labels of data points that have been classified in wrong class. Sampling problem has common characteristics with anomaly detection problem. This paper proposes a selective sampling method based on principal curves. Proposed algorithm returns a similarity factor between input stream and existing structures of predictor system in each step. No randomness is involved in the proposed algorithm to make it applicable in sensitive real-world problems.

Keywords:

selective sampling , query budget , principal curves

Authors

Aref Hakimzadeh

School of Electrical and Computer Engineering Shiraz University Shiraz, Fars

Koorush Ziarati

School of Electrical and Computer Engineering Shiraz University Shiraz, Fars

Certificate
من نویسنده این مقاله هستم

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این Paper:

https://civilica.com/doc/1281547

شناسه ملی سند علمی:

TECCONF05_037

تاریخ نمایه سازی: 11 مهر 1400

How to Cite to This Paper:

If you want to refer to this Paper in your research work, you can simply use the following phrase in the resources section:

Hakimzadeh, Aref and Ziarati, Koorush,1400,Selective Sampling via Similarity Factor for Sequential Imbalanced Data,Fifth National Conference on Computer Engineering,https://civilica.com/doc/1281547

Scientometrics

The specifications of the publisher center of this Paper are as follows:

Ranking of Shiraz University

Type of center: دانشگاه دولتی

Paper count: 27,386

In the scientometrics section of CIVILICA, you can see the scientific ranking of the Iranian academic and research centers based on the statistics of indexed articles.

مقالات مرتبط جدید