Using synthetic data and dimensionality reduction in high-dimensional classification via logistic regression

Publish Year: 1398
نوع سند: مقاله ژورنالی
زبان: English
View: 107

This Paper With 9 Page And PDF Format Ready To Download

  • Certificate
  • من نویسنده این مقاله هستم

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این Paper:

شناسه ملی سند علمی:

JR_CMDE-7-4_013

تاریخ نمایه سازی: 15 بهمن 1401

Abstract:

Traditional logistic regression  is plugged with degenerates and violent behavior in high-dimensional classification, because of   the problem of non-invertible matrices in estimating model parameters. In this paper, to overcome the high-dimensionality of data,  we introduce  two new algorithms. First, we  improve the efficiency of finite population Bayesian bootstrapping logistic regression classifier by using the rule of  majority vote.  Second, using simple random sampling without replacement to select a smaller number of covariates rather than the sample size and applying traditional logistic regression, we introduce the other new algorithm  for high-dimensional binary classification.   We compare the proposed algorithms with the regularized logistic regression  models and two other  classification algorithms, i.e., naive Bayes and K-nearest neighbors using both simulated and real data.

Authors

- -

Department of Statistics, Faculty of Science, University of Kurdistan, Sanandaj, Iran

- -

Department of Statistics, Faculty of Mathematics and Computer Science, Amirkabir University of Technology (Tehran Polytechnic), Tehran, Iran