A New High Performance GPU-based Approach to Prime Numbers Generation

Publish Year: 1393
نوع سند: مقاله کنفرانسی
زبان: English
View: 1,298

This Paper With 8 Page And PDF Format Ready To Download

  • Certificate
  • من نویسنده این مقاله هستم

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این Paper:

شناسه ملی سند علمی:

INDMATH01_066

تاریخ نمایه سازی: 10 شهریور 1393

Abstract:

SIMD Parallelization is one of the most useful ways of decreasing the computation time and increases the performance of computation intensive algorithms. To do such process, we could execute some processes on several machines by using different platforms like MPI, OpenMP and distribute the workload by using message passing and shared memory. One of the most popular and high performance methods is using an array of graphical processors (GPU) which is used in this paper to present a new technique to save data and do computation by overclocking sieve algorithm make use of CUDA coding. This method shows a good performance upgrade in computation time and memory usage on generating prime numbers in compare with CPU handling.

Authors

Amin Nezarat

Department of Computer, Payame Noor University, I.R.Iran

M.M Raja

Computer Department, Shiraz University, I.R.Iran

Gh Datghaibifard

Computer Department, Shiraz University, I.R.Iran

مراجع و منابع این Paper:

لیست زیر مراجع و منابع استفاده شده در این Paper را نمایش می دهد. این مراجع به صورت کاملا ماشینی و بر اساس هوش مصنوعی استخراج شده اند و لذا ممکن است دارای اشکالاتی باشند که به مرور زمان دقت استخراج این محتوا افزایش می یابد. مراجعی که مقالات مربوط به آنها در سیویلیکا نمایه شده و پیدا شده اند، به خود Paper لینک شده اند :
  • Message Passing Interface Forum (MPIF) MPI: A Message- Passing Interface ...
  • David R. Butenhof (1997). Programming with POSIX Threads. Addi son-Wesley, ...
  • Electronic Frontier Foundation (EFF) Cooperative Computing Awards. (March 1999) http ...
  • H. N. Gabow (2006). Introduction o Algorithms. University of Colorado, ...
  • H. Halberstam and H.E. Richert (1974). Sieve Method, Academic Press, ...
  • T. H. Myer and I. E. Sutherland, "On the design ...
  • J. D. Owens, M. Houston, D. Luebke, S. Green, J. ...
  • K. Datta, M. Murphy, V. Volkov, S. Williams, J. Carter, ...
  • USA: IEEE Press, 2008, pp. 1-12. [Online]. Available: http ://www.eecs ...
  • C. Huang, O. S. Lawlor, and L. V. Kal e, ...
  • O. S. Lawlor, M. Page, and J. Genetti, "MPI: Powerwall ...
  • Z. Fan, F. Qiu, and A. Kaufman, "ZippyGPU: ...
  • Programming toolkit for general-purpose computation on GPU clusters, " in ...
  • http : /gpgpu _ org/static/s c2006/works hop/SBU ZippyGPUAb stract.pdf ...
  • texture Distributedء [12] A. Moerschell and J. D. Owens, memory ...
  • _ .idav.ucdavis _ pub?pub ...
  • J. A. Stuart and J. D. Owens, "Message passing on ...
  • D. A. Patterso. "Latency lags bandwith, " Commun. ACM, vol. ...
  • _ Carter, The Game Asset Pipeline .Charles River Media, 2O4. ...
  • P. Micikevicius, _ finite difference computation on GPUs using CUDA, ...
  • Processing Units. New York, NY, USA: ACM, 2009, pp. 79-84. ...
  • L. Wesolowski _ application programming interface for general purpose graphics ...
  • نمایش کامل مراجع