CIVILICA We Respect the Science
(ناشر تخصصی کنفرانسهای کشور / شماره مجوز انتشارات از وزارت فرهنگ و ارشاد اسلامی: ۸۹۷۱)

A Graph-based Approach for Persian Entity Linking

عنوان مقاله: A Graph-based Approach for Persian Entity Linking
شناسه ملی مقاله: JR_ITRC-12-3_006
منتشر شده در در سال 1399
مشخصات نویسندگان مقاله:

Majid Asgari-Bidhendi - Computer Engineering School Iran University of Science and Technology Tehran, Iran
Farzane Fakhrian - Computer Engineering School Iran University of Science and Technology Tehran, Iran
Behrouz Minaei-Bidgoli - Computer Engineering School Iran University of Science and Technology Tehran, Iran

خلاصه مقاله:
Most of the data on the web is in the form of natural language, but natural language is highly ambiguous, especially when it comes to the frequent occurrence of entities. The goal of entity linking is to find entity mentions and link them to their corresponding entities in an external knowledge base. Recently, FarsBase was introduced as the first Persian knowledge base with nearly ۷۵۰,۰۰۰ entities. This research suggested one of the first end-to-end unsupervised entity linking systems specifically for Persian, using context and graph-based features to rank candidate entities. To evaluate the proposed method, we used the first Persian entity-linking dataset created by crawling social media text from some popular Telegram channels. The ParsEL results show that the F-Score of the input data set is ۸۷.۱% and is comparable to any other entity-linking system that supports Persian.

کلمات کلیدی:
Unsupervised Entity Linking, Entity Disambiguation, Persian Language, FarsBase, Knowledge Graph, Social Media Corpus

صفحه اختصاصی مقاله و دریافت فایل کامل: https://civilica.com/doc/1422197/