COI code: TETSCONF02_029
Paper Language: English
How to Download This Paper
For Downloading the Fulltext of CIVILICA papers please visit the orginal Persian Section of website.
Authors Improving classification accuracy of imbalanced data by Forest AlgorithmZahra Vahedinia, - Department of Computer Engineering, University of Tabriz, Tabriz, Iran
Mohammad-Reza Feizi-Derakhshi - Department of Computer Engineering, University of Tabriz, Tabriz, Iran
Abstract:Imbalanced data denotes data in which the number of data pieces related to two classes are not equal and one class has fewer samples than the other class. Regretfully, the majority of available databases in the real world, used for training systems such as filtering adult pages, diagnosing diseases and detecting intrusion, include unbalanced data. The presence of such data leads to the reduction of the training quality of the monitoring methods. Forest algorithm is considered as an optimization method which has been recently proposed by the researchers. It should be noted that this algorithm has not been used yet for balancing data. In this paper, forest algorithm is used for balancing data. The proposed method was investigated and its efficiency was tested through four different classifiers, i.e. Naive Bayes, artificial neural networks, decision tree and the nearest adjacent neighbor. Also, the proposed method was compared with other data balancing methods, including RS, SRAND, BRC and BRC+RS. According to the obtained results, the average detection rate of the proposed method was 5.7% higher than the imbalanced mode, 3.1% higher than RS method, 3.6% higher than SRAND method, 5.3% higher than BRC method and 2.4% higher than BRC+RS method. The highest detection result was 98% which was achieved by the Naive Bayes classifier.
Keywords:Imbalanced data, Forest algorithm
COI code: TETSCONF02_029
how to cite to this paper:If you want to refer to this article in your research, you can easily use the following in the resources and references section:
Vahedinia,, Zahra & Mohammad-Reza Feizi-Derakhshi, 2020, Improving classification accuracy of imbalanced data by Forest Algorithm, 2nd International Conference on Innovative Technologies in Science, Engineering and Technology, مونيخ-آلمان, شركت همايش آروين البرز, https://www.civilica.com/Paper-TETSCONF02-TETSCONF02_029.htmlInside the text, wherever referred to or an achievement of this article is mentioned, after mentioning the article, inside the parental, the following specifications are written.
First Time: (Vahedinia,, Zahra & Mohammad-Reza Feizi-Derakhshi, 2020)
Second and more: (Vahedinia, & Feizi-Derakhshi, 2020)
For a complete overview of how to citation please review the following CIVILICA Guide (Citation)
The University/Research Center Information:
Type: state university
Paper No.: 17376
in University Ranking and Scientometrics the Iranian universities and research centers are evaluated based on scientific papers.
Research Info Management
Export Citation info of this paper to research management softwares
New Related Papers
- VC readiness in Bio-entrepreneurship and its challenges
- Effects of Phytoremediation and Bioremediation on Microbial Respiration, Urease Enzyme Activity and Organic Nitrogen of Petroleum Hydrocarbon Contaminated Soil
- Modeling a Five-echelon Supply Chain Network under Disruption with considering Hub Centers by Scenario-based Approach
- Modeling a Sustainable Multi-objective Supplier Selection and Order Allocation Problem under Disruption of Supplier
- Modeling a Four-echelon Supply Chain Network under Disruption by Multi-Choice Fuzzy Goal Programming
The Above articles are recently indexed in the related subjects
Iran Scientific Advertisment Netword
Share this paper
WHAT IS COI?
COI is a national code dedicated to all Iranian Conference and Journal Papers. the COI of each paper can be verified online.