|
Frontiers of Information Technology & Electronic Engineering
ISSN 2095-9184 (print), ISSN 2095-9230 (online)
2023 Vol.24 No.4 P.509-520
Programming bare-metal accelerators with heterogeneous threading models: a case study of Matrix-3000
Abstract: As the hardware industry moves toward using specialized heterogeneous many-core processors to avoid the effects of the power wall, software developers are finding it hard to deal with the complexity of these systems. In this paper, we share our experience of developing a programming model and its supporting compiler and libraries for Matrix-3000, which is designed for next-generation exascale supercomputers but has a complex memory hierarchy and processor organization. To assist its software development, we have developed a software stack from scratch that includes a low-level programming interface and a high-level OpenCL compiler. Our low-level programming model offers native programming support for using the bare-metal accelerators of Matrix-3000, while the high-level model allows programmers to use the OpenCL programming standard. We detail our design choices and highlight the lessons learned from developing system software to enable the programming of bare-metal accelerators. Our programming models have been deployed in the production environment of an exascale prototype system.
Key words: Heterogeneous computing; Parallel programming models; Programmability; Compilers; Runtime systems
1国防科技大学计算机学院,中国长沙市,410073
2利兹大学计算学院,英国利兹市,LS2 9JT
摘要:随着处理器设计转向使用专门的异构多核以避免功耗墙的影响,软件开发人员发现很难处理这些处理器系统的复杂性。以Matrix-3000为代表的新型处理器具有复杂的内存层次结构和处理器组织,是为下一代E级超级计算机设计的高性能处理器。本文分享了我们为Matrix-3000开发的并行编程模型及其支持编译器和库的经验。为了帮助软件开发,我们从头开始开发了一个针对Matrix-3000的软件栈,包括一个低层次的编程接口和一个高层次的OpenCL编译器。该低层次编程模型为使用Matrix-3000的裸金属加速器提供了原生编程支持,而高层次模型允许程序员使用OpenCL并行编程标准。我们详细介绍了该软件栈的设计选择,并强调了从开发系统软件中学到的经验教训,以实现裸金属加速器的高效程序编写和性能解锁。我们的编程模型已经被部署到一个E级原型系统的生产环境中。
关键词组:
References:
Open peer comments: Debate/Discuss/Question/Opinion
<1>
DOI:
10.1631/FITEE.2200359
CLC number:
TP315
Download Full Text:
Downloaded:
2745
Download summary:
<Click Here>Downloaded:
425Clicked:
2389
Cited:
0
On-line Access:
2024-08-27
Received:
2023-10-17
Revision Accepted:
2024-05-08
Crosschecked:
2022-10-19