SGF (Semantic Graphs Fusion ): A Knowledge based Representation of Textual Resources for Text Mining Applications

Publish Year: 1398
نوع سند: مقاله ژورنالی
زبان: English
View: 206

This Paper With 15 Page And PDF Format Ready To Download

  • Certificate
  • من نویسنده این مقاله هستم

این Paper در بخشهای موضوعی زیر دسته بندی شده است:

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این Paper:

شناسه ملی سند علمی:

JR_JIST-7-2_001

تاریخ نمایه سازی: 23 دی 1399

Abstract:

The proper representation of textual documents has been the greatest challenge in text mining applications. In this paper, a knowledge-based representation model for text analysis applications is introduced. The proposed functionalities of the system are achieved by integrating structured knowledge in the core components of the system. The semantic, lexical, syntactical and structural features are identified by the pre-processing module. The enrichment module is introduced to identify contextually similar concepts and concept maps for improving the representation. The information content of documents and the enriched contents are then fused (merged) into the graphical structure of a semantic network to form a unified and comprehensive representation of documents. The ۲۰Newsgroup and Reuters-۲۱۵۷۸ datasets are used for evaluation. The evaluation results suggest that the proposed method exhibits a high level of accuracy, recall and precision. The results also indicate that even when a small portion of the information content is available, the proposed method performs well in standard text mining applications.

Authors

Morteza Jaderyan

Department of Computer Engineering, Bu Ali Sina University, Hamedan, Iran

Hassan Khotanlou

Department of Computer Engineering, Bu Ali Sina University, Hamedan, Iran