An Auto-Indexing Method for Persian Text

Publish Year: 1396
نوع سند: مقاله کنفرانسی
زبان: English
View: 322

This Paper With 12 Page And PDF and WORD Format Ready To Download

  • Certificate
  • من نویسنده این مقاله هستم

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این Paper:

شناسه ملی سند علمی:

ITCOMI01_025

تاریخ نمایه سازی: 24 شهریور 1397

Abstract:

This paper studies an approach to automatic indexing Persian context based on Persian grammatical rules in order to produce back-of-the-book index. Automatic indexing means automatically extract or select words from a document to create index. In this work, in order to present an approach for automatic indexing, SVM (Support Vector Machine) has been used to produce an intelligent system. The corpus has been applied is Bijankhan corpus which is a manually tagged Persian text collection. To evaluate proposed system, a book entitled Natural Low was considered as test set, while the index section of this book was done manually by human agent and compared with the automatic system. In this study, achieved precision and recall, were 53% and 90%, respectively.

Authors

Maryam Moasheri

Department of Computer, Arak Branch, Islamic Azad University, Arak, Iran