Quantitative Similarity-based Evaluation of Text Retrieval Algorithms

Publish Year: 1388
نوع سند: مقاله کنفرانسی
زبان: English
View: 2,082

This Paper With 6 Page And PDF Format Ready To Download

  • Certificate
  • من نویسنده این مقاله هستم

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این Paper:

شناسه ملی سند علمی:

CSICC14_051

تاریخ نمایه سازی: 24 خرداد 1388

Abstract:

Text retrieval engines, such as search engines,always return a list of documents in response to a given query. Existing evaluations of text retrieval algorithms mostly use Precision and Recall of the returned list of documents as main quality measures of a search engine. In this paper, we propose a novel approach for comparing different algorithms adopted by different search engines and evaluate their performance. In our approach, the results of each algorithm is treated as an inter-related set of documents and the effectiveness of the algorithm is evaluated based on the degree of relation in the set of documents. After verifying the correctness of the evaluation measure by examining the results of the two retrieval algorithms, BM25 and pivoted normalization, and comparing these results with an ideal ranking, we compare the results of these algorithms and investigate the impact of certain major factors like stemming on the results of the suggested algorithm. The effectiveness of our proposed method is justified through obtained xperimental results

Authors

Parastoo Didari

School of Electrical and Computer Engineering, University of Tehran, Tehran, Iran

Behrad Babai

School of Electrical and Computer Engineering, University of Tehran, Tehran, Iran

Azadeh Shakery

School of Electrical and Computer Engineering, University of Tehran, Tehran, Iran