基于FPGA的卷积神经网络加速模块设计
梅志伟,王维东

Design of Convolutional Neural Network acceleration module based on FPGA
Zhiwei Mei,Weidong Wang
表4 VGG?16卷积层加速性能
Table 4 The acceleration performance of convolutional layers of VGG?16
VGG?16网络层乘加数量时间(ms)性能(GOPS)乘加阵列效率
conv10.0867 G9.0619.279.35%
conv21.8496 G18.16203.7199.51%
conv30.9248 G9.05204.3999.80%
conv41.8497 G18.18203.4999.40%
conv50.9248 G9.06204.1699.73%
conv61.8497 G18.38201.2798.32%
conv71.8496 G18.18203.4999.40%
conv80.9248 G9.29199.1197.28%
conv91.8496 G18.58199.1197.27%
conv101.8496 G18.58199.1197.27%
conv110.4624 G5.32173.8485.03%
conv120.4624 G5.32173.8485.03%
conv130.4624 G5.32173.8485.03%
卷积层15.347 G162.48189.03