CIVILICA We Respect the Science
(ناشر تخصصی کنفرانسهای کشور / شماره مجوز انتشارات از وزارت فرهنگ و ارشاد اسلامی: ۸۹۷۱)

Comparison of the performance of machine learning algorithms in predicting heart disease

عنوان مقاله: Comparison of the performance of machine learning algorithms in predicting heart disease
شناسه ملی مقاله: JR_IJIMI-10-1_049
منتشر شده در در سال 1400
مشخصات نویسندگان مقاله:

Sajad Yousefi - Department of Electrical Engineering, Technical and Vocational University (TVU), Tehran, Iran

خلاصه مقاله:
Introduction: Heart disease is often associated with conditions such as clogged arteries due to the sediment accumulation which causes chest pain and heart attack. Many people die due to the heart disease annually. Most countries have a shortage of cardiovascular specialists and thus, a significant percentage of misdiagnosis occurs. Hence, predicting this disease is a serious issue. Using machine learning models performed on multidimensional dataset, this article aims to find the most efficient and accurate machine learning models for disease prediction.Material and Methods: Several algorithms were utilized to predict heart disease among which Decision Tree, Random Forest and KNN supervised machine learning are highly mentioned. The algorithms are applied to the dataset taken from the UCI repository including ۲۹۴ samples. The dataset includes heart disease features. To enhance the algorithm performance, these features are analyzed, the feature importance scores and cross validation are considered.Results:The algorithm performance is compared with each other, so that performance based on ROC curve and some criteria such as accuracy, precision, sensitivity and F۱ score were evaluated for each model. As a result of evaluation, Accuracy, AUC ROC are ۸۳% and ۹۹% respectively for Decision Tree algorithm. Logistic Regression algorithm with accuracy and AUC ROC are ۸۸% and ۹۱% respectively has better performance than other algorithms. Therefore, these techniques can be useful for physicians to predict heart disease patients and prescribe them correctly.Conclusion: Machine learning technique can be used in medicine for analyzing the related data collections to a disease and its prediction. The area under the ROC curve and evaluating criteria related to a number of classifying algorithms of machine learning to evaluate heart disease and indeed, the prediction of heart disease is compared to determine the most appropriate classification. As a result of evaluation, better performance was observed in both Decision Tree and Logistic Regression models.

کلمات کلیدی:
Machine Learning, Heart Disease, Dataset, Decision Tree, Logistic Regression

صفحه اختصاصی مقاله و دریافت فایل کامل: https://civilica.com/doc/1500458/