A Real-time Image Processing Hardware Acceleration Method based on FPGA - Details

Author：

Yuan, Haiying (Yuan, Haiying.) | Ding, Dong (Ding, Dong.) | Fan, Zhongwei (Fan, Zhongwei.) | Sun, Zengyang (Sun, Zengyang.)

Indexed by：

EI Scopus

Abstract：

Real-time　image　sensed　by　the　visual　sensor　usually　contains　a　lot　of　noise　information.　Model　reasoning,　and　pattern　recognition-oriented　CNNs　face　such　thorny　issues　as　excessive　computation,　poor　accuracy　and　high　resource　occupancy.　Hence,　CNN　architecture　was　heterogeneously　deployed　on　the　Zynq　platform　to　realize　hardware　acceleration　for　the　image　processing　algorithm.　MNIST　dataset　was　adopted　to　train　CNN　for　extracting　network　parameters　on　PC　terminal　under　the　Caffe　framework;　the　convolutional　layer　responsible　for　heavy　computational　load　was　deployed　onto　FPGA　for　parallel　computing　to　increase　system　speed;　input　layer　and　output　layer　responsible　for　a　small　amount　computation　were　placed　on　ARM　terminal　to　reduce　resource　consumption;　real-time　image　acquired　by　the　camera　was　binarized　to　highlight　image　features　and　improve　the　recognition　accuracy;　the　hardware　acceleration　performance　of　the　heterogeneously　deployed　CNN　was　verified　with　numerous　experiments　on　image　recognition　of　handwritten　numerals.　Experimental　results　indicated　that:　CNN　hardware　accelerator　kept　an　image　recognition　accuracy　up　to　99.02%　which　is　largely　equivalent　to　that　of　client　PC;　When　recognizing　a　single　piece　of　handwritten　numerical　sample,　under　the　use　of　optimized　instructions　and　100MHz　clock　frequency,　the　recognition　time　of　a　single　image　is　0.53s,　which　is　16　times　faster　than　pure　ARM　operation;　the　maximum　power　consumption　of　the　system　is　2.606W,　which　is　far　Lower　than　general-purpose　processors.　©　2021　IEEE.

Keyword：

Acceleration Image recognition Energy efficiency Field programmable gate arrays (FPGA) Convolution Deep learning Character recognition Convolutional neural networks General purpose computers Image enhancement

Author Community：

[ 1 ] [Yuan, Haiying]Beijing University of Technology, Faculty of Information Technology, Beijing; 100124, China
[ 2 ] [Ding, Dong]Beijing University of Technology, Faculty of Information Technology, Beijing; 100124, China
[ 3 ] [Fan, Zhongwei]Beijing University of Technology, Faculty of Information Technology, Beijing; 100124, China
[ 4 ] [Sun, Zengyang]Beijing University of Technology, Faculty of Information Technology, Beijing; 100124, China

Reprint Author's Address：

Email：

Show more details

Related Keywords：

Universal Accelerator Software and Hardware Collaborative Design for YOLO Algorithm
2022，2022 International Conference on Electronic Information Technology, EIT 2022
LSFQ: A Low-Bit Full Integer Quantization for High-Performance FPGA-Based CNN Acceleration
2022，IEEE Micro
Animal Image Recognition Method Based on Two-pass Feature Fusion
2022，10th IEEE Joint International Information Technology and Artificial Intelligence Conference, ITAIC 2022
An algorithm based on AVS encoding on FPGA multi-core pipeline
2013，2013 5th International Conference on Computational and Information Sciences, ICCIS 2013

Source ：

Year： 2021

Page： 200-205

Language： English

Cited Count：

WoS CC Cited Count： 0

SCOPUS Cited Count： 3

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 16

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Engineering Village

Type
Departments

All Years Choose Year From to