A Comparative Study of Multilayer Neural Network and C۴.۵Decision Tree Models for Predicting the Risk of Breast Cancer

Publish Year: 1397
نوع سند: مقاله ژورنالی
زبان: English
View: 136

متن کامل این Paper منتشر نشده است و فقط به صورت چکیده یا چکیده مبسوط در پایگاه موجود می باشد.
توضیح: معمولا کلیه مقالاتی که کمتر از ۵ صفحه باشند در پایگاه سیویلیکا اصل Paper (فول تکست) محسوب نمی شوند و فقط کاربران عضو بدون کسر اعتبار می توانند فایل آنها را دریافت نمایند.

  • Certificate
  • من نویسنده این مقاله هستم

این Paper در بخشهای موضوعی زیر دسته بندی شده است:

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این Paper:

شناسه ملی سند علمی:

JR_ARCHB-5-1_001

تاریخ نمایه سازی: 24 خرداد 1400

Abstract:

Background: Diagnosing breast cancer at an early stage can have a great impacton cancer mortality. One of the fundamental problems in cancer treatment is the lackof a proper method for early detection, which may lead to diagnostic errors. Usingdata analysis techniques can significantly help in early diagnosis of the disease. Thepurpose of this study was to evaluate and compare the efficacy of two data miningtechniques, i.e., multilayer neural network and C۴.۵, in early diagnosis of breastcancer. Methods: A data set from Motamed Cancer Institute's breast cancer researchclinic, Tehran, containing ۲۸۶۰ records related to breast cancer risk factors wereused. Of the records, ۱۱۴۱ (۴۰%) were related to malignant changes and breastcancer and ۱۷۱۹ (۶۰%) to benign tumors. The data set was analyzed usingperceptron neural network and decision tree algorithms, and was split into two atraining data set (۷۰%) and a testing data set (۳۰%) using Rapid Miner ۵.۲. Results: For neural networks, accuracy was ۸۰.۵۲%, precision ۸۸.۹۱%, andsensitivity ۹۰.۸۸%; and for decision tree, accuracy was ۸۰.۹۸%, precision ۸۰.۹۷%,and sensitivity ۸۹.۳۲%. Results indicated that both algorithms have acceptablecapabilities for analyzing breast cancer data. Conclusion: Although both models provided good results, neural networkshowed more reliable diagnosis for positive cases. Data set type and analysismethod affect results. On the other hand, information about more powerful riskfactors of breast cancer, such as genetic mutations, can provide models with highcoverage.

Authors

Soolmaz Sohrabi

Shahid Beheshti University of Medical Sciences, Department of Medical Informatics, Tehran, Iran

Alireza Atashi

Department of E-Health, Virtual School, Tehran University of Medical Sciences, Tehran, Iran- Medical Informatics Department, Breast Cancer Research Center, Motamed cancer institute (ACECR), Tehran, Iran

Ali Dadashi

Mashhad University of Medical Sciences, Department Of Medical Informatics, Mashhad, Iran

Sina Marashi

Department of E-Health, Virtual School, Tehran University of Medical Sciences, Tehran, Iran