A Novel Approach to Conditional Random Field-based Named Entity Recognition using Persian Specific Features
Publish Year: 1399
Type: Journal paper
Language: English
View: 447
This Paper With 10 Page And PDF Format Ready To Download
- Certificate
- I'm the author of the paper
این Paper در بخشهای موضوعی زیر دسته بندی شده است:
Export:
Document National Code:
JR_JADM-8-2_007
Index date: 22 July 2020
A Novel Approach to Conditional Random Field-based Named Entity Recognition using Persian Specific Features abstract
Named Entity Recognition is an information extraction technique that identifies name entities in a text. Three popular methods have been conventionally used namely: rule-based, machine-learning-based and hybrid of them to extract named entities from a text. Machine-learning-based methods have good performance in the Persian language if they are trained with good features. To get good performance in Conditional Random Field-based Persian Named Entity Recognition, a several syntactic features based on dependency grammar along with some morphological and language-independent features have been designed in order to extract suitable features for the learning phase. In this implementation, designed features have been applied to Conditional Random Field to build our model. To evaluate our system, the Persian syntactic dependency Treebank with about 30,000 sentences, prepared in NOOR Islamic science computer research center, has been implemented. This Treebank has Named-Entity tags, such as Person, Organization and location. The result of this study showed that our approach achieved 86.86% precision, 80.29% recall and 83.44% F-measure which are relatively higher than those values reported for other Persian NER methods.
A Novel Approach to Conditional Random Field-based Named Entity Recognition using Persian Specific Features Keywords:
A Novel Approach to Conditional Random Field-based Named Entity Recognition using Persian Specific Features authors
L. Jafar Tafreshi
Computer Research Center of Islamic Sciences (CRCIS), Tehran, Iran.
F. Soltanzadeh
General Linguistics Department, Allameh Tabatabaei University, Tehran, Iran.