CIVILICA We Respect the Science
(ناشر تخصصی کنفرانسهای کشور / شماره مجوز انتشارات از وزارت فرهنگ و ارشاد اسلامی: ۸۹۷۱)

FarsWikiKG: an Automatically Constructed Knowledge Graph for Persian

عنوان مقاله: FarsWikiKG: an Automatically Constructed Knowledge Graph for Persian
شناسه ملی مقاله: JR_IJWR-4-2_004
منتشر شده در در سال 1400
مشخصات نویسندگان مقاله:

Farhad Shirmardi - Amirkabir University of Technology
Seyed Mohammad Hadi Hosseini - Amirkabir University of Technology
Saeedeh Momtazi - Amirkabir University of Technology

خلاصه مقاله:
We present FarsWikiKG, a Persian knowledge graph extracted from Wikipedia. Wikipedia infoboxes have been used as a valuable resource for building knowledge graphs in recent years. FarsWikiKG consists of more than ۲ million entities, as well as ۵.۷ million facts about the entities. Using Wikidata, we constructed an ontology with more than ۶۰۰۰ classes representing entity types. As the second Persian knowledge graph, which has the ability of self-update, FarsWikiKG shows improvement on NLP tasks, especially question answering systems. Although FarsWikiKG is a dynamic knowledge graph, our evaluation shows a coverage of ۹۰% on Persian Wikipedia pages. As Wikipedia information is constantly changing, a fixed knowledge graph can provide unstable data to the user. The proposed system, in addition to solving the problem of unstable data, reduces the need for experts to extract and construct knowledge graphs manually. Storing information in RDF as a standard method of storing knowledge graph information, FarsWikiKG allows NLP systems to run SPARQL queries on it.

کلمات کلیدی:
Knowledge Graph, Wikipedia, RDF, Persian

صفحه اختصاصی مقاله و دریافت فایل کامل: https://civilica.com/doc/1505670/