Publishing Service

Polishing & Checking

Frontiers of Information Technology & Electronic Engineering

ISSN 2095-9184 (print), ISSN 2095-9230 (online)

ONFS: a hierarchical hybrid file system based on memory, SSD, and HDD for high performance computers

Abstract: With supercomputers developing towards exascale, the number of compute cores increases dramatically, making more complex and larger-scale applications possible. The input/output (I/O) requirements of large-scale applications, workflow applications, and their checkpointing include substantial bandwidth and an extremely low latency, posing a serious challenge to high performance computing (HPC) storage systems. Current hard disk drive (HDD) based underlying storage systems are becoming more and more incompetent to meet the requirements of next-generation exascale supercomputers. To rise to the challenge, we propose a hierarchical hybrid storage system, on-line and near-line file system (ONFS). It leverages dynamic random access memory (DRAM) and solid state drive (SSD) in compute nodes, and HDD in storage servers to build a three-level storage system in a unified namespace. It supports portable operating system interface (POSIX) semantics, and provides high bandwidth, low latency, and huge storage capacity. In this paper, we present the technical details on distributed metadata management, the strategy of memory borrow and return, data consistency, parallel access control, and mechanisms guiding downward and upward migration in ONFS. We implement an ONFS prototype on the TH-1A supercomputer, and conduct experiments to test its I/O performance and scalability. The results show that the bandwidths of single-thread and multi-thread &x2018;read&x2019;/&x2018;write&x2019; are 6-fold and 5-fold better than HDD-based Lustre, respectively. The I/O bandwidth of data-intensive applications in ONFS can be 6.35 times that in Lustre.

Key words: High performance computing, Hierarchical hybrid storage system, Distributed metadata management, Data migration

Chinese Summary  <24> ONFS:面向高性能计算的基于内存、固态硬盘和磁盘的层次式混合文件系统

概要:随着超级计算机向Eflops规模快速发展和计算核数急剧增加,更大规模和更复杂的应用成为可能。大规模科学计算、新的工作流应用以及检查点操作均需要存储系统具有非常高的带宽和低延迟,这使得高性能存储系统面临严峻的技术挑战。当前基于磁盘的底层存储系统难以满足新一代Eflops超级计算机和应用的要求。为此,本文提出了基于计算结点内存、固态硬盘和磁盘的层次式混合存储系统ONFS(on-line and near-line file system)。它具有三个存储层次和统一的命名空间,支持可移植操作系统接口(portable operating system interface, POSIX)协议,可提供高带宽、低延迟和超大存储容量。本文详细分析了分布式元数据管理、内存借用和归还策略、数据一致性、并行访问控制,以及向下迁移和向上主动预迁移机制。在天河一号超级计算机上实现了ONFS原型系统,测试了I/O(input/output)性能和可扩展性。测试结果表明,单线程和多线程读/写性能比基于磁盘的Lustre分别高出6倍和5倍。与Lustre相比,运行在ONFS上的典型数据密集型应用可获得6.35倍的I/O加速。

关键词组:高性能计算;层次式混合存储系统;分布式元数据管理;数据迁移


Share this article to: More

Go to Contents

References:

<Show All>

Open peer comments: Debate/Discuss/Question/Opinion

<1>

Please provide your name, email address and a comment





DOI:

10.1631/FITEE.1700626

CLC number:

TP303

Download Full Text:

Click Here

Downloaded:

2675

Download summary:

<Click Here> 

Downloaded:

1813

Clicked:

9535

Cited:

0

On-line Access:

2018-02-06

Received:

2017-09-25

Revision Accepted:

2017-12-25

Crosschecked:

2017-12-27

Journal of Zhejiang University-SCIENCE, 38 Zheda Road, Hangzhou 310027, China
Tel: +86-571-87952276; Fax: +86-571-87952331; E-mail: jzus@zju.edu.cn
Copyright © 2000~ Journal of Zhejiang University-SCIENCE