An Auto-Indexing Method for Persian Text
Publish place: همایش جامع بین المللی کامپیوتر، فناوری اطلاعات و مهندسی برق
Publish Year: 1396
نوع سند: مقاله کنفرانسی
زبان: English
View: 322
This Paper With 12 Page And PDF and WORD Format Ready To Download
- Certificate
- من نویسنده این مقاله هستم
استخراج به نرم افزارهای پژوهشی:
شناسه ملی سند علمی:
ITCOMI01_025
تاریخ نمایه سازی: 24 شهریور 1397
Abstract:
This paper studies an approach to automatic indexing Persian context based on Persian grammatical rules in order to produce back-of-the-book index. Automatic indexing means automatically extract or select words from a document to create index. In this work, in order to present an approach for automatic indexing, SVM (Support Vector Machine) has been used to produce an intelligent system. The corpus has been applied is Bijankhan corpus which is a manually tagged Persian text collection. To evaluate proposed system, a book entitled Natural Low was considered as test set, while the index section of this book was done manually by human agent and compared with the automatic system. In this study, achieved precision and recall, were 53% and 90%, respectively.
Keywords:
Authors
Maryam Moasheri
Department of Computer, Arak Branch, Islamic Azad University, Arak, Iran