Effective Learning to Rank for the Persian Web Content

Keyhanipour

Effective Learning to Rank for the Persian Web Content

Publish place: Journal of Information Technology Management، Vol: 11، Issue: 4

Publish Year: 1398

نوع سند: مقاله ژورنالی

زبان: English

This Paper With 18 Page And PDF Format Ready To Download

دریافت فایل کامل Paper

Certificate
من نویسنده این مقاله هستم

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این Paper:

https://civilica.com/doc/1400834

شناسه ملی سند علمی:

JR_JITM-11-4_007

تاریخ نمایه سازی: 26 بهمن 1400

Abstract:

Persian language is one of the most widely used languages in the Web environment. Hence, the Persian Web includes invaluable information that is required to be retrieved effectively. Similar to other languages, ranking algorithms for the Persian Web content, deal with different challenges, such as applicability issues in real-world situations as well as the lack of user modeling. CF-Rank, as a recently proposed learning to rank data, aims to deal with such issues by the classifier fusion idea. CF-Rank generates a few click-through features, which provide a compact representation of a given primitive dataset. By constructing the primitive classifiers on each category of click-through features and aggregating their decisions by the use of information fusion techniques, CF-Rank has become a successful ranking algorithm in English datasets. In this paper, CF-Rank is customized for the Persian Web content. Evaluation results of this algorithm on the dotIR dataset indicate that the customized CF-Rank outperforms baseline rankings. Especially, the improvement is more noticeable at the top of ranked lists, which are observed most of the time by the Web users. According to the NDCG@۱ and MAP evaluation criteria, comparing the CF-Rank with the preeminent baseline algorithm on the dotIR dataset indicates an improvement of ۳۰ percent and ۱۶.۵ percent, respectively.

Keywords:

Learning to rank , Persian language , CF-Rank algorithm , dotIR dataset , Information fusion

Authors

Keyhanipour

Assistant Professor, Computer Engineering Department, Faculty of Engineering, College of Farabi, University of Tehran, Iran.

مراجع و منابع این Paper:

لیست زیر مراجع و منابع استفاده شده در این Paper را نمایش می دهد. این مراجع به صورت کاملا ماشینی و بر اساس هوش مصنوعی استخراج شده اند و لذا ممکن است دارای اشکالاتی باشند که به مرور زمان دقت استخراج این محتوا افزایش می یابد. مراجعی که مقالات مربوط به آنها در سیویلیکا نمایه شده و پیدا شده اند، به خود Paper لینک شده اند :

AdvancedWebRanking. (۲۰۱۹, July). Google Organic CTR History, Fresh CTR averages ...
Busa-Fekete, R., Kégl, B., Éltető, T., & Szarvas, G. (۲۰۱۳). ...
Cen, R., Liu, Y., Zhang, M., Zhou, B., Ru, L., ...
Chapelle, O., & Chang, Y. (۲۰۱۱). Yahoo! Learning to Rank ...
Darrudi, E., Hashemi, H. B., AleAhmad, A., Zare Bidoki, A., ...
Derhami, V., Paksima, J., & Khajeh, H. (۲۰۱۹). RRLUFF: Ranking ...
Filev, D., & Yager, R. R. (۱۹۹۴). Learning OWA operator ...
Hashemi, H. B., Yazdani, N., Shakery, A., & Naeini, M. ...
Holte, R. C. (۱۹۹۳). Very simple classification rules perform well ...
Joachims, T. (۲۰۰۲). Optimizing search engine using clickthrough data. The ...
Keyhanipour, A., Moshiri, B., & Rahgozar, M. (۲۰۱۵). CF-Rank: Learning ...
Khodadadian, E., Ghasemzadeh, M., Derhami, V., & Mirsoleimani, A. (۲۰۱۲). ...
Liu, T.-Y. (۲۰۱۱). Learning to rank for information retrieval. Springer-Verlag ...
Macdonald, C., Santos, R. L., & Ounis, I. (۲۰۱۲). On ...
Makvana, K., Patel, J., Shah, P., & Thakkar, A. (۲۰۱۸). ...
Manning, C. D., Raghavan, P., & Schütze, H. (۲۰۰۸). An ...
Mitchell, T. (۱۹۹۷). Machine Learning. McGraw Hill ...
Qin, T., & Liu, T.-Y. (۲۰۱۳). Introducing LETOR ۴.۰ datasets. ...
W۳Techs. (۲۰۱۹, July). Usage of content languages for websites. W۳Techs. ...

نمایش کامل مراجع