Sentimental Categorization of Persian News Headlines using Three Machine Learning Techniques Versus Human Categorization

Publish Year: 1398
نوع سند: مقاله ژورنالی
زبان: English
View: 187

This Paper With 10 Page And PDF Format Ready To Download

  • Certificate
  • من نویسنده این مقاله هستم

این Paper در بخشهای موضوعی زیر دسته بندی شده است:

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این Paper:

شناسه ملی سند علمی:

JR_JACR-10-4_007

تاریخ نمایه سازی: 13 اردیبهشت 1400

Abstract:

The aim of this paper is to elaborate on an attempt to classify Persian news headlines using machine learning techniques rather than human-based analysis. Three major techniques namely Naïve Bayes, Maximum Entropy and Support Vector Machine were introduced and applied to Persian news headlines. Results were compared with each other as well as the human analysis. It is concluded that these techniques outperform human analysis and one technique (Naïve Bayes) is superior to all the techniques mentioned. It can be concluded from this study that the inclusion of discourse analysis is necessary in order to attain better results since the whole is not necessarily the sum of the parts. It means that what you see in the headline does not necessarily reflect what is mentioned in the news itself. So it is recommended that in future studies, elements from discourse analysis be introduced into these algorithms so that better results can be achieved.

Authors

Vahid Mirzaeian

ELT Department, Alzahra University, Tehran, Iran