عنوان انگلیسی مقاله:
A Survey and Taxonomy of FPGA-based Deep Learning Accelerators
ترجمه فارسی عنوان مقاله:
مرور و طبقه بندی شتاب دهنده های یادگیری عمیق مبتنی بر FPGA
Sciencedirect - Elsevier - Journal of Systems Architecture, 98 (2019) 331-345: doi:10:1016/j:sysarc:2019:01:007
Ahmed Ghazi Blaiech a , b , ∗ , Khaled Ben Khalifa a , b , Carlos Valderrama c , Marcelo A.C. Fernandes d , Mohamed Hedi Bedoui b
Deep learning, the fastest growing segment of Artificial Neural Network (ANN), has led to the emergence of many machine learning applications and their implementation across multiple platforms such as CPUs, GPUs and recon- figurable hardware ( Field-Programmable Gate Arrays or FPGAs). However, inspired by the structure and function of ANNs, large-scale deep learning topologies require a considerable amount of parallel processing, memory re- sources, high throughput and significant processing power. Consequently, in the context of real time hardware systems, it is crucial to find the right trade-offbetween performance, energy efficiency, fast development, and cost. Although limited in size and resources, several approaches have showed that FPGAs provide a good starting point for the development of future deep learning implementation architectures. Through this paper, we briefly review recent work related to the implementation of deep learning algorithms in FPGAs. We will analyze and compare the design requirements and features of existing topologies to finally propose development strategies and implementation architectures for better use of FPGA-based deep learning topologies. In this context, we will examine the frameworks used in these studies, which will allow testing a lot of topologies to finally arrive at the best implementation alternatives in terms of performance and energy efficiency.
Keywords: Deep learning | Framework | Optimized implementation | FPGA