New Method Of Feature Selection For Persian Text MiningBased On Evolutionary Algorithms

Publish Year: 1394
نوع سند: مقاله ژورنالی
زبان: English
View: 476

This Paper With 5 Page And PDF Format Ready To Download

  • Certificate
  • من نویسنده این مقاله هستم

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این Paper:

شناسه ملی سند علمی:

JR_ACSIJ-4-6_007

تاریخ نمایه سازی: 4 خرداد 1395

Abstract:

Today, with the increasingly growing volume of textinformation, text classification methods seem to be essential.Also, increase in the volume of Persian text resources adds tothe importance of this issue. However, classification workswhich have been especially done in Persian are not still asextensive as those of Latin, Chinese, etc. In this paper, a systemfor Persian text classification is presented. This system is able toimprove the standards of accuracy, retrieval and total efficiency.To achieve this goal, in this system, after texts preprocessingand feature extraction, a new improved method of featureselection based on Particle Swarm Optimization algorithm(PSO) is innovated for reducing dimension of feature vector.Eventually, the classification methods are applied in the reducedfeature vector. To evaluate feature selection methods in theproposed classification system, classifiers of support vectormachine (SVM), Naive Bayes, K nearest neighbor (KNN) andDecision Tree are employed. Results of the tests obtained fromthe implementation of the proposed system on a set ofHamshahri texts indicated its improved precision, recall, andoverall efficiency. Also, SVM classification method had betterperformance in this paper.

Authors

Akram Roshdi

Department of Computer, Islamic Azad University, Khoy Branch,Iran