JZUS - Journal of Zhejiang University SCIENCE

Journal of Zhejiang University SCIENCE C 1998 Vol.-1 No.-1 P.

SAPER-AI accelerator: systolic array based power-efficient reconfigurable AI accelerator

Author(s): Fahad bin MUSLIM^†^‡¹, Kashif INAYAT^†², Muhammad zain SIDDIQI^†¹, Safiullah KHAN³, Tayyeb MAHMOOD⁴, Ihtesham ul ISLAM⁵
Affiliation(s): ¹Faculty of Computer Science and Engineering, GIK Institute 23460, Pakistan; more
Corresponding email(s): fahad.muslim@giki.edu.pk, kashif.inayat@bsc.es, zain.siddiqi@giki.edu.pk
Key Words: AI accelerators, ASIC design, Systolic arrays, Low power designs

Share this article to： More <<< Previous Article \|Next Article >>>

Fahad bin MUSLIM^†^‡¹ , Kashif INAYAT^†², Muhammad zain SIDDIQI^†¹ , Safiullah KHAN³,Tayyeb MAHMOOD⁴ , Ihtesham ul ISLAM⁵. SAPER-AI accelerator: systolic array based power-efficient reconfigurable AI accelerator[J]. Frontiers of Information Technology & Electronic Engineering, 1998, -1(-1): .

@article{title="SAPER-AI accelerator: systolic array based power-efficient reconfigurable AI accelerator",
author="Fahad bin MUSLIM^†^‡¹ , Kashif INAYAT^†², Muhammad zain SIDDIQI^†¹ , Safiullah KHAN³,Tayyeb MAHMOOD⁴ , Ihtesham ul ISLAM⁵",
journal="Frontiers of Information Technology & Electronic Engineering",
volume="-1",
number="-1",
pages="",
year="1998",
publisher="Zhejiang University Press & Springer",
doi="10.1631/FITEE.2400867"
}

%0 Journal Article
%T SAPER-AI accelerator: systolic array based power-efficient reconfigurable AI accelerator
%A Fahad bin MUSLIM^†^‡¹
%A Kashif INAYAT^†²
%A Muhammad zain SIDDIQI^†¹
%A Safiullah KHAN³
%A Tayyeb MAHMOOD⁴
%A Ihtesham ul ISLAM⁵
%J Journal of Zhejiang University SCIENCE C
%V -1
%N -1
%P
%@ 2095-9184
%D 1998
%I Zhejiang University Press & Springer
%DOI 10.1631/FITEE.2400867

TY - JOUR
T1 - SAPER-AI accelerator: systolic array based power-efficient reconfigurable AI accelerator
A1 - Fahad bin MUSLIM^†^‡¹
A1 - Kashif INAYAT^†²
A1 - Muhammad zain SIDDIQI^†¹
A1 - Safiullah KHAN³
A1 - Tayyeb MAHMOOD⁴
A1 - Ihtesham ul ISLAM⁵
J0 - Journal of Zhejiang University Science C
VL - -1
IS - -1
SP -
EP -
%@ 2095-9184
Y1 - 1998
PB - Zhejiang University Press & Springer
ER -
DOI - 10.1631/FITEE.2400867

Abstract
Chinese Summary
Academic Network
Reviewer Comment

Abstract: Deep learning (DL) accelerators are critical for handling the growing computational demands of modern neural networks. Systolic array (SA) based accelerators consist of a 2D mesh of processing elements (PE) working cooperatively to accelerate matrix multiplication, a fundamental operation in DL. The power efficiency of such accelerators is of primary importance especially considering the edge AI regime. This work presents the SAPER-AI accelerator, an SA accelerator with power intent specified via a unified power format representation in a simplified manner with negligible micro-architectural optimization effort. Our proposed accelerator switches off rows and columns of PEs in a coarse-grained manner, thus leading to SA micro-architecture complying with the varying computational requirements of modern DL workloads. Our analysis demonstrates enhanced power efficiency ranging between 11% and 25% for the best case 32×32 and 64×64 SA designs, respectively. Additionally, the power delay product (PDP) exhibited a progressive improvement of around 6% for larger SA sizes. Moreover, a performance comparison between the MobileNet and ResNet50 models indicated generally better SA performance for the ResNet50 workload. This is due to the more regular convolutions portrayed by ResNet50 that are more favored by SAs, with the performance gap widening as the SA size increases.

Darkslateblue:Affiliate; Royal Blue:Author; Turquoise:Article

Open peer comments: Debate/Discuss/Question/Opinion

<1>

Similar articles

- Go to

Darkslateblue:Affiliate; Royal Blue:Author; Turquoise:Article