CIVILICA We Respect the Science
(ناشر تخصصی کنفرانسهای کشور / شماره مجوز انتشارات از وزارت فرهنگ و ارشاد اسلامی: ۸۹۷۱)

An improved joint model: POS tagging and dependency parsing

عنوان مقاله: An improved joint model: POS tagging and dependency parsing
شناسه ملی مقاله: JR_JADM-4-1_001
منتشر شده در شماره 1 دوره 4 فصل در سال 1395
مشخصات نویسندگان مقاله:

A. Pakzad - Department of Computer Engineering, Iran University of Science & Technology, Tehran, Iran.
B. Minaei Bidgoli - Department of Computer Engineering, Iran University of Science & Technology, Tehran, Iran.

خلاصه مقاله:
Dependency parsing is a way of syntactic parsing and a natural language that automatically analyzes the dependency structure of sentences, and the input for each sentence creates a dependency graph. Part-Of-Speech (POS) tagging is a prerequisite for dependency parsing. Generally, dependency parsers do the POS tagging task along with dependency parsing in a pipeline mode. Unfortunately, in pipeline models, a tagging error propagates, but the model is not able to apply useful syntactic information. The goal of joint models simultaneously reduce errors of POS tagging and dependency parsing tasks. In this research, we attempted to utilize the joint model on the Persian and English language using Corbit software. We optimized the model s features and improved its accuracy concurrently. Corbit software is an implementation of a transition-based approach for word segmentation, POS tagging and dependency parsing. In this research, the joint accuracy of POS tagging and dependency parsing over the test data on Persian, reached 85.59% for coarse-grained and 84.24% for fine-grained POS. Also, we attained 76.01% for coarse-grained and 74.34% for fine-grained POS on English.

کلمات کلیدی:
Joint model, Part-Of-Speech, Dependency Parsing, Persian Language

صفحه اختصاصی مقاله و دریافت فایل کامل: https://civilica.com/doc/894162/