Ontology Creation and Population for Natural Language Processing Domain

Publish Year: 1397
نوع سند: مقاله ژورنالی
زبان: English
View: 414

This Paper With 10 Page And PDF Format Ready To Download

  • Certificate
  • من نویسنده این مقاله هستم

این Paper در بخشهای موضوعی زیر دسته بندی شده است:

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این Paper:

شناسه ملی سند علمی:

JR_IJWR-1-2_007

تاریخ نمایه سازی: 21 اردیبهشت 1399

Abstract:

In this paper, we describe our proposed methodology for constructing an ontology of natural language processing (NLP). We use a semi-automatic method; a combination of rule-based and machine learning techniques; to construct and populate an ontology with bilingual (English-Persian) concept labels (lexicon) and evaluate it manually. This methodology results in a complete ontology in the natural language processing domain with 1333 classes (containing concepts, tools, applications, etc.), 88 object properties, and 2437 annotation assertions for different classes. The built ontology is populated with about 428K NLP related papers and 38K authors, and also about 5M is Related to relations between papers and ontology classes and 1M is Author of relations between papers and authors. The evaluation results show that the ontology achieved a good result. The instantiation is done to enable applications find experts, publications and institutions (such as universities or research laboratories) related to various topics in NLP field.

Authors

Niloofar Naderian

Computer Science and Engineering Faculty, Shahid Beheshti University, Tehra, Iran.

Mehrnoush Shamsfard

Faculty of Computer Science and Engineering, Shahdi Beheshti University of Technology, Tehran, Iran

Razieh Adelkhah

Faculty of Computer Science and Engineering Shahid Beheshti University Tehran, Iran