Implementation of Fuzzy K - means clustering algorithm and comparison of its performance on different bioinformatics data sets (gene expression)

Publish Year: 1399
نوع سند: مقاله کنفرانسی
زبان: English
View: 362

This Paper With 6 Page And PDF Format Ready To Download

  • Certificate
  • من نویسنده این مقاله هستم

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این Paper:

شناسه ملی سند علمی:

ITCT11_049

تاریخ نمایه سازی: 18 اردیبهشت 1400

Abstract:

Clustering or cluster analysis in statistics and machine learning is one of the branches of uncontrolled learning. In this paper, the implementation and comparison of the performance of K - means fuzzy clustering algorithm on four different bioinformatics datasets using MATLAB software is discussed. Since the performance of each algorithm varies according to the problem we are dealing with, the performance of the algorithm was compared to the biological data set published by KAGGLE. To evaluate this algorithm, the Silhouette Value was used and the best value for K (number of clusters) was obtained. The results show the efficiency of this algorithm for different values of the number of clusters according to the characteristics of each data set to the optimal value according to Specify and use input data.

Authors

Hassan Nosrati Nahook

Department of Computer Engineering, Higher Education Complex of Saravan, Saravan, Iran

Shayesteh Tabatabaei

Department of Computer Engineering, Higher Education Complex of Saravan, Saravan, Iran