CIVILICA We Respect the Science
(ناشر تخصصی کنفرانسهای کشور / شماره مجوز انتشارات از وزارت فرهنگ و ارشاد اسلامی: ۸۹۷۱)

Thai Insult Detection System Based on Linguistic Features Analysis

عنوان مقاله: Thai Insult Detection System Based on Linguistic Features Analysis
شناسه ملی مقاله: SASTECH06_080
منتشر شده در ششمین کنفرانس بین المللی پیشرفتهای علوم و تکنولوژی در سال 1391
مشخصات نویسندگان مقاله:

Tanasanti Jirapon - Technology of Information System Management, Faculty of Engineering, Mahidol University, Nakorn Pathom, Thailand
Phokharatkul Pisit - Dept. of Computer Engineering, Faculty of Engineering, Mahidol University, Nakorn Pathom, Thailand
Buntilov Vladimir
Kanoksilpatham Budsaba - English Department, Faculty of Arts, Silpakorn University, Nakorn Pathom ۷۳۰۰۰, Thailand

خلاصه مقاله:
Verbal insults often appear in online communities during textual communication between users. Current automatic prevention algorithms which employ regular expression techniques for word filtering tend to result in high false-positive errors. This paper presents an alternative method for detecting insults in Thai textual conversations based on the analysis of linguistic features. The performance of the presented algorithms was compared with the regular expression based algorithms, in terms of precision and recall scores. The results of the experiments showed that the inaccuracies in the employed third-party natural language processing procedures affected the performance of the proposed insult detection method. Once the problematic NLP procedures were improved, the proposed method outperforms regular expression based algorithms, showing lower false-positive error rate.

کلمات کلیدی:
Regular Expression, Word Filter, Natural Language Processing, Online Communities, Linguistic Features

صفحه اختصاصی مقاله و دریافت فایل کامل: https://civilica.com/doc/158966/