CIVILICA We Respect the Science
(ناشر تخصصی کنفرانسهای کشور / شماره مجوز انتشارات از وزارت فرهنگ و ارشاد اسلامی: ۸۹۷۱)

Automatic Classification for Vietnamese News

عنوان مقاله: Automatic Classification for Vietnamese News
شناسه ملی مقاله: JR_ACSIJ-4-4_018
منتشر شده در شماره 4 دوره 4 فصل July در سال 1394
مشخصات نویسندگان مقاله:

Phan Thi Ha - Posts and Telecommunications Institute of Technology Hanoi, Vietnam
Nguyen Quynh Chi - Posts and Telecommunications Institute of Technology Hanoi, Vietnam

خلاصه مقاله:
This paper proposes an automatic framework to classify Vietnamese news from news sites on the Internet. In this proposed framework, the extracted main content of Vietnamesenews is performed automatically by applying the improved performance extraction method from [1]. This information willbe classified by using two machine learning methods: Support vector machine and naïve bayesian method. Our experimentsimplemented with Vietnamese news extracted from some sitesshowed that the proposed classification framework give acceptable results with a rather high accuracy, leading to applying it to real information systems.

کلمات کلیدی:
news classification; automatic extraction; support vector machine, naïve bayesian networks

صفحه اختصاصی مقاله و دریافت فایل کامل: https://civilica.com/doc/405236/