CIVILICA We Respect the Science
(ناشر تخصصی کنفرانسهای کشور / شماره مجوز انتشارات از وزارت فرهنگ و ارشاد اسلامی: ۸۹۷۱)

A Survey on Modern Automatic Text Subject Recognition Methods

عنوان مقاله: A Survey on Modern Automatic Text Subject Recognition Methods
شناسه ملی مقاله: COMCONF07_058
منتشر شده در هفتمین کنگره ملی تازه یافته های مهندسی برق ایران در سال 1399
مشخصات نویسندگان مقاله:

Ali Naserasadi - Computer Group, Zarand Higher Education Complex, Zarand, Kerman, Iran,
Majid Estilayee - Technical and Engineering, Payam-e Nour, Tehran, Iran,

خلاصه مقاله:
The automatic identification of text subject is the basis for a variety of information analysis mechanisms, such as document classification, retrieval, and frontier identification of fields. Therefore, the research on automatic identification methods of text subject is significant. In This paper the key technologies of current text subject recognition are systematically investigated, including the method of obtaining topic words, the calculation of the correlation strength of knowledge units, and the subject analysis methods and practices for multi-relationship fusion. On the basis of summing up the shortcomings of the current text subject recognition methods, the paper proposes a comprehensive method for acquiring topic words, which is used in combination with the extraction range and the grammatical and semantic levels.

کلمات کلیدی:
Topic Recognition; Text Analysis; Subject Mining; Semantic Analysis

صفحه اختصاصی مقاله و دریافت فایل کامل: https://civilica.com/doc/1037702/