Improving the Classification of Unknown Documents by Concept Graph

Publish Year: 1388
نوع سند: مقاله کنفرانسی
زبان: English
View: 2,030

This Paper With 6 Page And PDF Format Ready To Download

  • Certificate
  • من نویسنده این مقاله هستم

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این Paper:

شناسه ملی سند علمی:

CSICC14_095

تاریخ نمایه سازی: 24 خرداد 1388

Abstract:

Concept graph is a graph that represents the relationships between language concepts. In this structure the relationship between any two words is demonstrated by a weighted edge such that the value of this weight is interpreted as the degree of the relevance of two words. Having this graph, we can obtain most relevant words to a special term. In this paper, we propose a method for improving the classification of documents from unknown sources by means of concept graph. In our method, initially some features are selected from a training set by a well-known feature selection algorithm. Then, by extracting most relevant words for each class from the concept graph, a more effective feature set is produced. Our experimental results identify an improvement of 1% and 8% in precision and recall measures, respectively.

Authors

Morteza Mohaqeqi

ECE Department, University of Tehran, Tehran, Iran

Reza Soltanpoor

Computer Department, Islamic Azad University of Tehran North branch, Tehran, Iran

Azadeh Shakery

ECE Department, University of Tehran, Tehran, Iran