PTokenizer: POS Tagger Tokenizer

Publish Year: 1394
نوع سند: مقاله کنفرانسی
زبان: English
View: 460

This Paper With 5 Page And PDF Format Ready To Download

  • Certificate
  • من نویسنده این مقاله هستم

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این Paper:

شناسه ملی سند علمی:

KBEI02_247

تاریخ نمایه سازی: 5 بهمن 1395

Abstract:

By the advent of new information sources and the expansion of text data, natural language processing (NLP) has become one of the key parts of all the systems dealing with human written texts, and part of speech (POS) tagging is an inseparable part of all NLP tasks. As a result, it is of the paramount importance to enhance the accuracy of POS tagging. In this paper, applying language model and statistical information, we introduce a new approach to tokenize sentences and prepare them to be labeled by POS taggers. An evaluation shows that the proposed method yields a precision of 98 percent for tokenizing, and applying it to a Maximum Likelihood and TnT POS taggers achieve improvement in the accuracy of Persian POS tagging

Keywords:

Tokenizer , Part of Speech Tagging , Probabilistic Model , Compound Tokens

Authors

Saeed Rahmani

Department of Computer and IT Engineering Shiraz University Shiraz, Iran

Seyyed Mostafa Fakhrahmad

Department of Computer and IT Engineering Shiraz University Shiraz, Iran

Mohammad Hadi Sadredini

Department of Computer and IT Engineering Shiraz University Shiraz, Iran

مراجع و منابع این Paper:

لیست زیر مراجع و منابع استفاده شده در این Paper را نمایش می دهد. این مراجع به صورت کاملا ماشینی و بر اساس هوش مصنوعی استخراج شده اند و لذا ممکن است دارای اشکالاتی باشند که به مرور زمان دقت استخراج این محتوا افزایش می یابد. مراجعی که مقالات مربوط به آنها در سیویلیکا نمایه شده و پیدا شده اند، به خود Paper لینک شده اند :
  • _ _ _ natural language ...
  • Brants, Thorsten. "TnT: a statistical part-of-speech tagger." Proceedings _ _ ...
  • Brill, Eric. "A simple rule-based part of speech tagger." Proceedings ...
  • Daelemans, Walter, et al. "MBT: A memory-based part of speech ...
  • Schmid, Helmut. "Probabilistic part-of-speech tagging using decision trees ."Proceedings of ...
  • Ratnaparkhi, Adwait. "A maximum entropy model for part-of-Sspeech tagging. "Proceedings ...
  • Halacsy, Peter, Andras Kornai, and Csaba Oravec, "HunPos: an open ...
  • Zavrel, Jakub, and Walter Daelemans. "Evaluatie van part-of-5speech taggers voor ...
  • _ _ _ _ _ _ _ _ _ _ ...
  • Oroumchian, Farhad, et al. "Creating a feasible corpus for Persian ...
  • Azimizadeh, Ali, Mohammad Mehdi Arab, and Saeid Rahati Quchani. "Persian ...
  • Amiri, Hadi, et al. "A Survey of part of speech ...
  • Sagot, Benoit, et al. "A new morphological lexicon and a ...
  • Shamsfard, Mehmoush, and Hakimeh Fadaei. "A Hybrid Morphology- Based POS ...
  • Copyright Notice is: 9 78-1 -4673-6506-2/ _ 00 C2015 IEEE ...
  • نمایش کامل مراجع