سیویلیکا را در شبکه های اجتماعی دنبال نمایید.

A Novel Approach to Conditional Random Field-based Named Entity Recognition using Persian Specific Features

Publish Year: 1399
Type: Journal paper
Language: English
View: 447

This Paper With 10 Page And PDF Format Ready To Download

این Paper در بخشهای موضوعی زیر دسته بندی شده است:

Export:

Link to this Paper:

Document National Code:

JR_JADM-8-2_007

Index date: 22 July 2020

A Novel Approach to Conditional Random Field-based Named Entity Recognition using Persian Specific Features abstract

Named Entity Recognition is an information extraction technique that identifies name entities in a text. Three popular methods have been conventionally used namely: rule-based, machine-learning-based and hybrid of them to extract named entities from a text. Machine-learning-based methods have good performance in the Persian language if they are trained with good features. To get good performance in Conditional Random Field-based Persian Named Entity Recognition, a several syntactic features based on dependency grammar along with some morphological and language-independent features have been designed in order to extract suitable features for the learning phase. In this implementation, designed features have been applied to Conditional Random Field to build our model. To evaluate our system, the Persian syntactic dependency Treebank with about 30,000 sentences, prepared in NOOR Islamic science computer research center, has been implemented. This Treebank has Named-Entity tags, such as Person, Organization and location. The result of this study showed that our approach achieved 86.86% precision, 80.29% recall and 83.44% F-measure which are relatively higher than those values reported for other Persian NER methods.

A Novel Approach to Conditional Random Field-based Named Entity Recognition using Persian Specific Features Keywords:

A Novel Approach to Conditional Random Field-based Named Entity Recognition using Persian Specific Features authors

L. Jafar Tafreshi

Computer Research Center of Islamic Sciences (CRCIS), Tehran, Iran.

F. Soltanzadeh

General Linguistics Department, Allameh Tabatabaei University, Tehran, Iran.