CIVILICA We Respect the Science
(ناشر تخصصی کنفرانسهای کشور / شماره مجوز انتشارات از وزارت فرهنگ و ارشاد اسلامی: ۸۹۷۱)

Persian Wordnet Construction using Supervised Learning

عنوان مقاله: Persian Wordnet Construction using Supervised Learning
شناسه ملی مقاله: JR_ITRC-9-2_005
منتشر شده در در سال 1396
مشخصات نویسندگان مقاله:

Zahra Mousavi
Heshaam Faili
Marzieh Fadaee

خلاصه مقاله:
This paper presents an automated supervised method for Persian wordnet construction. Using a Persian corpus and a bi-lingual dictionary, the initial links between Persian words and Princeton WordNet synsets have been generated. These links will be discriminated later as correct or incorrect by employing seven features in a trained classification system. The whole method is just a classification system which has been trained on a train set containing a pre-existing Persian wordnet, FarsNet, as a set of correct instances. A set of some sophisticated distributional and semantic features is proposed to be used in the classification system. Furthermore, a set of randomly selected links have been added to training data as incorrect instances. The links classified as correct are collected to be included in the final wordnet. State of the art results on the automatically derived Persian wordnet is achieved. The resulted wordnet with a precision of 91.18% includes more than 16,000 words and 22,000 synsets.

کلمات کلیدی:
wordnet, ontology, supervised, Persian language

صفحه اختصاصی مقاله و دریافت فایل کامل: https://civilica.com/doc/1165917/