The XMLization of a Dependency Treebank in CoNLL Format for Evaluating Linguistic Queries using Xquery

Publish Year: 1394
نوع سند: مقاله کنفرانسی
زبان: English
View: 495

This Paper With 5 Page And PDF Format Ready To Download

  • Certificate
  • من نویسنده این مقاله هستم

این Paper در بخشهای موضوعی زیر دسته بندی شده است:

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این Paper:

شناسه ملی سند علمی:

KBEI02_280

تاریخ نمایه سازی: 5 بهمن 1395

Abstract:

Treebanks are essential resources for both data-driven approaches to natural language processing (NLP) and empirical linguistic researches. Developing these resources is time- and cost-consuming and requires specialized expertise. Therefore, they should be designed to be reused for different purposes. Currently, there are several dependency treebanks for some languages which are annotated in CoNLL format. For some languages, such as Persian, they are the few available linguistic resources. These treebanks are more suitable for the input of data-driven parsers, and querying linguistic data in them is not easy. In recent years, XML has been widely used for formatting treebanks, and there are various tools available for querying and annotating a linguistic croups in this format. In this paper, we present a tool for converting a dependency treebank in CoNLL format to an appropriate XML format. We designed the XML scheme to be particularly suitable for writing linguistic queries in XQuery syntax.

Authors

Ahmad Pouramini

Department of Computer Engineering Sirjan University of Technology, Sirjan, Iran

Amine Naseri

Department of Computer Engineering Sirjan University of Technology, Sirjan, Iran

مراجع و منابع این Paper:

لیست زیر مراجع و منابع استفاده شده در این Paper را نمایش می دهد. این مراجع به صورت کاملا ماشینی و بر اساس هوش مصنوعی استخراج شده اند و لذا ممکن است دارای اشکالاتی باشند که به مرور زمان دقت استخراج این محتوا افزایش می یابد. مراجعی که مقالات مربوط به آنها در سیویلیکا نمایه شده و پیدا شده اند، به خود Paper لینک شده اند :
  • Berglund, Anders, et al. "Xml path language (xpath)." World Wide ...
  • _ _ dependency treebank." Treebanks. ...
  • Bouma, Gosse, and Geert Kloosterman. "Mining syntactically annotated ...
  • Sabine, et al. "The TIGER treebank." Proceedings of the workshop ...
  • _ _ A query language for XML ...
  • _ i _ _ "XML-based _ t _ _ Stand-off ...
  • _ _ full text _ ...
  • Pajas, Petr, and Jan Stepanek. "A Generic XML-Based Format for ...
  • _ _ _ Technologies. 2013. ...
  • Rehm, Georg, Richard Eckart, and Christian Chiarcos. "An OWL-and ...
  • Van der Beek, Leonoor, et a. "The Alpino dependency treebank." ...
  • _ _ _ _ _ _ Springer Berlin Heidelberg, 2013. ...
  • _ _ _ "Making a large ...
  • Copyright Notice is: 9 78-1 -4673-6506-2/ _ 00 C2015 IEEE ...
  • نمایش کامل مراجع