The Predictability of Tree-based Machine Learning Algorithms in the Big Data Context

Publish Year: 1400
نوع سند: مقاله ژورنالی
زبان: English
View: 336

This Paper With 8 Page And PDF Format Ready To Download

  • Certificate
  • من نویسنده این مقاله هستم

این Paper در بخشهای موضوعی زیر دسته بندی شده است:

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این Paper:

شناسه ملی سند علمی:

JR_IJE-34-1_010

تاریخ نمایه سازی: 6 اردیبهشت 1400

Abstract:

This research work is concerned with the predictability of ensemble and singular tree-based machine learning algorithms during the recession and prosperity of the two companies listed in the Tehran Stock Exchange in the context of big data. In this regard, the main issue is that economic managers and the academic community require predicting models with more accuracy and reduced execution time; moreover, the prediction of the companies recession in the stock market is highly significant. Machine learning algorithms must be able to appropriately predict the stock return sign during the market downturn and boom days. Addressing the stated challenge will upgrade the quality of stock purchases and, subsequently, will increase profitability. In this article, the proposed solution relies on the utilization of tree-based machine learning algorithms in the context of big data. The proposed solution exploits the decision tree algorithm, which is a traditional and singular tree-based learning algorithm. Furthermore, two modern and ensemble tree-based learning algorithms, random forest and gradient boosted tree, has been utilized for predicting the stock return sign during recession and prosperity. The mentioned cases were implemented by applying the machine learning tools in python programming language and PYSPARK library that is used explicitly for the big data context. The utilized research data of the current study are the shares information of two companies of the Tehran Stock Exchange. The obtained results reveal that the applied ensemble learning algorithms have performed better than the singular learning algorithms. Additionally, adding ۲۳ technical features to the initial data and subsequent applying of the PCA feature reduction method have demonstrated the best performance among other modes. In the meantime, it has been concluded that the initial data do not possess the proper resolution or generalizability, either during prosperity or recession.

Keywords:

Stock Market Big Data Prediction Machine Learning Tree , based Algorithms Ensemble Algorithms

Authors

F. Qolipour

Computer Engineering Department, Yazd University, Yazd, Iran

M. Ghasemzadeh

Computer Engineering Department, Yazd University, Yazd, Iran

N. Mohammad-Karimi

Computer Engineering Department, Yazd University, Yazd, Iran

مراجع و منابع این Paper:

لیست زیر مراجع و منابع استفاده شده در این Paper را نمایش می دهد. این مراجع به صورت کاملا ماشینی و بر اساس هوش مصنوعی استخراج شده اند و لذا ممکن است دارای اشکالاتی باشند که به مرور زمان دقت استخراج این محتوا افزایش می یابد. مراجعی که مقالات مربوط به آنها در سیویلیکا نمایه شده و پیدا شده اند، به خود Paper لینک شده اند :
  • Khedmati. M, Seifi. F, Azizi. M.J, “Time Series Forecasting of ...
  • Hemati. H.R., Ghasemzadeh. M,  Meinel. C, “A Hybrid Machine Learning ...
  • of Engineering, Transactions C: Aspects, Vol. 29, No. 9, (2016), ...
  • Liu, J, and Kemp. A, “Forecasting the sign of U.S. ...
  • Jiang. M, Liu. J, Zhang. L, and Liu. C, “An ...
  • Begenau. J, Farboodi. M, and Veldkamp. L, “Big data in ...
  • Breiman. L, “Bagging Predictors”, Machine Learning Archive, Vol. 24, No. ...
  • Freund. Y and Schapire. R.E, “Experiments with a New Boosting ...
  • Tsai. C.-F, Lin. Y.-C, Yen. D.C, and Chen. Y.M, “Predicting ...
  • Ballings. M, Van den Poel. D, Hespeels. N, and Gryp. ...
  • Basak. S, Kar. S, Saha. S, Khaidem. L, and Dey. ...
  • Weng. B, Martinez. W.G, Tsai. Y, Li. C, Lu. L, ...
  • نمایش کامل مراجع