A Precise SVM Classification Model for Predictions with Missing Data

Publish Year: 1397
نوع سند: مقاله کنفرانسی
زبان: English
View: 693

This Paper With 14 Page And PDF Format Ready To Download

  • Certificate
  • من نویسنده این مقاله هستم

این Paper در بخشهای موضوعی زیر دسته بندی شده است:

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این Paper:

شناسه ملی سند علمی:

EMCE04_328

تاریخ نمایه سازی: 21 خرداد 1398

Abstract:

In a well-studied and controlled research work, missing data are the common occurrence can have the significant influence on the results and accuracy of the study. Missing data can cause biased estimates and leads to the wrong conclusions. This paper develops a model to classify a two-class problem and report the classification accuracy over the stratified 10 folds cross-validation based on the provided data with missing values. The dataset contains 14 features and two classes, in which there are missing data without any expressions with an arbitrary pattern. To classify the data and predicate the missing values, the data is preprocessed in the first step, and it has been sent to the proposed SVM (Support Vector Machine) model for further processes. In order to improve the accuracy of classification, the metaheuristic methods such as GSM (Grid Search Method), PSO (Particle Swarm Optimization), and GA (Genetic Algorithm) have been used to extract the best parameters, C (penalty parameter) and g (kernel function parameter) of the SVM, and then, their F-Measures have been calculated to choose the best model.

Authors

Chuyi Zheng

Department, Centre for Engineering Innovation (CEI), University of Windsor, ON N۹B ۱K۳, Canad