Configurable Hardware Architecture of Multidimensional Convolution Coprocessor
| dc.contributor.author | Geranun Boonyuu | |
| dc.contributor.author | Sumek Wisayataksin | |
| dc.date.accessioned | 2026-05-08T19:19:44Z | |
| dc.date.issued | 2021-1-20 | |
| dc.description.abstract | We propose a configurable coprocessor for the convolutional neural network (CNN) that suit various models of CNN. It can operate 2D standard convolution, 2D depthwise separable convolution, 3D convolution, and a fully connected layer. The proposed processing cluster consists of 72 processing units (PUs) of half-precision floating-point to assist the main processor in embedded systems. The experimental results on Artix-7 FPGA revealed that our design has 12.16 GOPs per cluster. Moreover, this architecture was designed to be scalable for the systems with higher performance. | |
| dc.identifier.doi | 10.1109/ica-symp50206.2021.9358447 | |
| dc.identifier.uri | https://dspace.kmitl.ac.th/handle/123456789/17175 | |
| dc.subject | Advanced Neural Network Applications | |
| dc.subject | Advanced Vision and Imaging | |
| dc.subject | Advanced Image and Video Retrieval Techniques | |
| dc.title | Configurable Hardware Architecture of Multidimensional Convolution Coprocessor | |
| dc.type | Article |