CIVILICA We Respect the Science
(ناشر تخصصی کنفرانسهای کشور / شماره مجوز انتشارات از وزارت فرهنگ و ارشاد اسلامی: ۸۹۷۱)

Web Page Streams and Relevance Propagation for Topic Distillation

عنوان مقاله: Web Page Streams and Relevance Propagation for Topic Distillation
شناسه ملی مقاله: JR_ITRC-6-1_005
منتشر شده در در سال 1392
مشخصات نویسندگان مقاله:

Mohammad Amin Golshani
Ali Mohammad Zareh Bidoki

خلاصه مقاله:
Over the past decade, several studies in field of relevance propagation models have been proposed to improve quality of web search, which include hyperlink-based score propagation, hyperlinkbased term propagation and popularity-based relevance propagation models; however, all of them have used low precision content similarity functions in the propagation process and their throughputs are not entirely satisfactory. In this paper, two stream-based content similarity functions that could be used to derive new relevance propagation models were introduced. In the proposed content similarity functions, the web page was split to different streams with different degrees of importance and the text of each web page was divided between these streams. To evaluate the proposed relevance propagation models, Letor ۳.۰ (including two standard web test collections) was used in the experiments. It was concluded that splitting web pages as different streams could provide significant improvement in relevance propagation models.

کلمات کلیدی:
Web page streams, relevance propagation, topic distillation, information retrieval, search engine, web page ranking

صفحه اختصاصی مقاله و دریافت فایل کامل: https://civilica.com/doc/1425784/