A Partial Method for Calculating CNN Networks Based On Loop Tiling
Publish place: International Journal of Information and Communication Technology Research (IJICT، Vol: 15، Issue: 2
Publish Year: 1401
نوع سند: مقاله ژورنالی
زبان: English
View: 105
This Paper With 7 Page And PDF Format Ready To Download
- Certificate
- من نویسنده این مقاله هستم
استخراج به نرم افزارهای پژوهشی:
شناسه ملی سند علمی:
JR_ITRC-15-2_002
تاریخ نمایه سازی: 22 مرداد 1402
Abstract:
Convolutional Neural Networks (CNNs) have been widely deployed in the fields of artificial intelligence and computer vision. In these applications, the CNN part is the most computationally intensive. When these applications are run in an embedded device, the embedded processor can hardly handle the processing. This paper implements loop tiling to explain how one can construct a lightweight, low-power, and efficient CNN hardware accelerator for embedded computing devices. This method breaks a large CNN engine into small CNN engines and calculates them by low hardware resources. Finally, the results of small CNN engines are added and concatenated to construct the large CNN output. Using this method, a small accelerator can be configured to run a wide range of large CNNs. A small accelerator with one layer is designed to evaluate our methodology. Our initial investigations show that based on our methodology, the constructed accelerator can run a modified version of MobileNetV۱, ۷۰ times per second.
Keywords:
Authors
Ali Ali A.D. Farahani
School of Computer Engineering Iran University of Science and Technology Tehran, Iran
Hakem Beitollahi
School of Computer Engineering Iran University of Science and Technology Tehran, Iran
Mahmood Fathy
School of Computer Engineering Iran University of Science and Technology Tehran, Iran
Reza Barangi
School of Computer Engineering Iran University of Science and Technology Tehran, Iran