基于FPGA的卷积神经网络加速模块设计
梅志伟,王维东

Design of Convolutional Neural Network acceleration module based on FPGA
Zhiwei Mei,Weidong Wang
表2 AlexNet硬件加速性能
Table 2 The hardware acceleration performance of AlexNet
AlexNet层数卷积核/步长时间(ms)乘加阵列效率
conv111×11/411.39.10%
conv25×5/14.498.42%
conv33×3/11.781.84%
conv43×3/12.683.66%
conv53×3/11.881.66%
fc16×6/112.13.07%
fc21×1/15.53.00%
fc31×1/11.42.74%