Job Failure Prediction in Grid Environment Based on Workload Characteristics

Publish Year: 1388
نوع سند: مقاله کنفرانسی
زبان: English
View: 2,022

This Paper With 6 Page And PDF Format Ready To Download

  • Certificate
  • من نویسنده این مقاله هستم

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این Paper:

شناسه ملی سند علمی:

CSICC14_006

تاریخ نمایه سازی: 24 خرداد 1388

Abstract:

The power of grid technology in aggregating autonomous resources owned by several organizations into a single virtual system has made it popular in compute-intensive and data-intensive applications. Complex and dynamic nature of grid makes failure of users’ jobs fairly probable. Furthermore, traditional methods for job failure recovery have proven costly and thus a need to shift toward proactive and predictive management strategies is necessary in such systems. In this paper, an innovative effort is made to predict the futurity of jobs submitted to a production grid environment (AuverGrid). By analyzing grid workload traces and extracting patterns describing common failure characteristics, the success or failure status of jobs during 6 months of AuverGrid activity was predicted with around 96% accuracy. The quality of services on grid can be improved by integrating the result of this work into management services like scheduling and monitoring.

Authors

Hamid Fadishei

Parallel and Distributed Processing Lab, Ferdowsi University of Mashhad, Iran

Hamid Saadatfar

Parallel and Distributed Processing Lab, Ferdowsi University of Mashhad, Iran

Hossein Deldari

Parallel and Distributed Processing Lab, Ferdowsi University of Mashhad, Iran