A Distant Supervised Approach for Relation Extraction in Farsi Texts

Publish Year: 1399
نوع سند: مقاله ژورنالی
زبان: English
View: 514

This Paper With 8 Page And PDF Format Ready To Download

  • Certificate
  • من نویسنده این مقاله هستم

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این Paper:

شناسه ملی سند علمی:

JR_IJWR-3-2_001

تاریخ نمایه سازی: 23 خرداد 1400

Abstract:

The volume of Farsi information on the Internet has been increasing in recent years. However, most of this information is in the form of unstructured or semi-structured free text. For quick and accurate access to the vast knowledge contained in these texts, the information extraction methods are essential to generate knowledge bases. In recent years, relation extraction as a sub-task of information extraction has received much attention. While many of these systems were developed in English and other well-known languages, the systems for information extraction in Farsi have received less attention from researchers. In this systematic research for semi-automatic relation extraction, Persian Wikipedia articles were presented as reliable and semi-structured sources. In this system, the relation extraction is performed with the assistance of patterns that are automatically obtained with an approach based on distant supervised. In order to apply the distant supervised, the vast knowledge base of Wikidata has been used as a source in perfect synchronization with Wikipedia. The results show that the average precision value for all relations is ۷۶.۸۱%, which indicates an enhancement of precision compared to other methods in Farsi.

Authors

Shireen Atarod

Department of Computer Engineering, Science and Research Branch, Azad University, Tehran, Iran.

Alireza Yari

Assistant Professor, ICT Research Institute (ITRC), Tehran, Iran