N2V2PRO | BibSonomy

Abstract

Convolutional neural networks (CNNs) have been demonstrated to be a successful approach in the field of artificial intelligence (AI). Deploying CNNs on embedded devices at a large scale would contribute significantly to the advancement and practical implementation of AI in various industries. However, the complexity of CNNs in terms of memory and operation requirements poses challenges in terms of computing performance, memory bandwidth, and flexibility of the executing hardware. This paper introduces a framework that addresses these issues through model quantization and hardware acceleration on a scalable vertical vector processor architecture. Firstly, the framework includes a method for layer fusion, which is designed to optimize the hardware utilization. Secondly, data storage is optimized to enhance memory efficiency. Lastly, CNNs are mapped onto the vertical vector processing concept of the hardware accelerator. The effectiveness of the proposed framework is evaluated by analyzing the accelerator efficiency based on a field-programmable gate array (FPGA). The results demonstrate that the framework offers flexibility, configurability, and efficient mapping for typical CNN implementations. The framework achieves up to 84% of the peak performance of the vector processor for the VGG net.

Links and resources

BibTeX key: gesper2023n2v2pro
entry type: conference
booktitle: 2023 IEEE 13th International Conference on Consumer Electronics - Berlin, ICCE-Berlin 2023
year: 2023
pages: 94--99
publisher: IEEE Computer Society
DOI: 10.1109/ICCE-Berlin58801.2023.10375652
url: http://www.scopus.com/inward/record.url?scp=85182920276&partnerID=8YFLogxK

Cite this publication

@conference{gesper2023n2v2pro, abstract = {Convolutional neural networks (CNNs) have been demonstrated to be a successful approach in the field of artificial intelligence (AI). Deploying CNNs on embedded devices at a large scale would contribute significantly to the advancement and practical implementation of AI in various industries. However, the complexity of CNNs in terms of memory and operation requirements poses challenges in terms of computing performance, memory bandwidth, and flexibility of the executing hardware. This paper introduces a framework that addresses these issues through model quantization and hardware acceleration on a scalable vertical vector processor architecture. Firstly, the framework includes a method for layer fusion, which is designed to optimize the hardware utilization. Secondly, data storage is optimized to enhance memory efficiency. Lastly, CNNs are mapped onto the vertical vector processing concept of the hardware accelerator. The effectiveness of the proposed framework is evaluated by analyzing the accelerator efficiency based on a field-programmable gate array (FPGA). The results demonstrate that the framework offers flexibility, configurability, and efficient mapping for typical CNN implementations. The framework achieves up to 84% of the peak performance of the vector processor for the VGG net.}, added-at = {2024-02-05T16:18:36.000+0100}, author = {Gesper, Sven and Thieu, Gia Bao and Kohler, Daniel and Kock, Markus and Berthold, Tim and Renke, Oliver and Blume, Holger and Paya-Vaya, Guillermo}, biburl = {https://www.bibsonomy.org/bibtex/20f3ea6398c3565b6222d12b8def9e15c/fabcho}, booktitle = {2023 IEEE 13th International Conference on Consumer Electronics - Berlin, ICCE-Berlin 2023}, comment = {Publisher Copyright: © 2023 IEEE. 10.1109/ICCE-Berlin58801.2023.10375652}, doi = {10.1109/ICCE-Berlin58801.2023.10375652}, interhash = {a9b63c7846cedb15e846d4ee25e6c4f1}, intrahash = {0f3ea6398c3565b6222d12b8def9e15c}, keywords = {Accelerator CNN Conversion Custom Hardware Layer Mapping Network Neural Quantization myown}, pages = {94--99}, publisher = {IEEE Computer Society}, timestamp = {2024-03-05T15:40:20.000+0100}, title = {N2V2PRO}, url = {http://www.scopus.com/inward/record.url?scp=85182920276&partnerID=8YFLogxK}, year = 2023 }

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

N2V2PRO

Abstract

Links and resources

Tags

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML N2V2PRO

Abstract

Links and resources

Tags

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

N2V2PRO

Comments and Reviews
(0)