Configurable Hardware Architecture of Multidimensional Convolution Coprocessor

dc.contributor.authorGeranun Boonyuu
dc.contributor.authorSumek Wisayataksin
dc.date.accessioned2026-05-08T19:19:44Z
dc.date.issued2021-1-20
dc.description.abstractWe propose a configurable coprocessor for the convolutional neural network (CNN) that suit various models of CNN. It can operate 2D standard convolution, 2D depthwise separable convolution, 3D convolution, and a fully connected layer. The proposed processing cluster consists of 72 processing units (PUs) of half-precision floating-point to assist the main processor in embedded systems. The experimental results on Artix-7 FPGA revealed that our design has 12.16 GOPs per cluster. Moreover, this architecture was designed to be scalable for the systems with higher performance.
dc.identifier.doi10.1109/ica-symp50206.2021.9358447
dc.identifier.urihttps://dspace.kmitl.ac.th/handle/123456789/17175
dc.subjectAdvanced Neural Network Applications
dc.subjectAdvanced Vision and Imaging
dc.subjectAdvanced Image and Video Retrieval Techniques
dc.titleConfigurable Hardware Architecture of Multidimensional Convolution Coprocessor
dc.typeArticle

Files

Collections