Full Text:   <3>

CLC number: TP333.93

On-line Access: 2026-03-02

Received: 2025-11-22

Revision Accepted: 2026-01-29

Crosschecked: 2026-03-02

Cited: 0

Clicked: 3

Citations:  Bibtex RefMan EndNote GB/T7714

 ORCID:

Weiguo WU

https://orcid.org/0009-0000-8298-0572

-   Go to

Article info.
Open peer comments

Frontiers of Information Technology & Electronic Engineering  2026 Vol.27 No.2 P.1-16

http://doi.org/10.1631/ENG.ITEE.2025.0152


GC bypass: decoupling GC from the flash translation layer to eliminate GC-induced long-tail latency inside SSD


Author(s):  Shiqiang NIE, Jie NIU, Yingzhao SHAO, Xiaobo LI, Mingming ZHANG, Weiguo WU

Affiliation(s):  1. School of Computer Science and Technology, Xian Jiaotong University, Xian 710049, China more

Corresponding email(s):   wgwu@xjtu.edu.cn

Key Words:  Solid-state drive (SSD), NAND flash, Garbage collection (GC), Interconnected network, Flash channel


Share this article to: More

Shiqiang NIE, Jie NIU, Yingzhao SHAO, Xiaobo LI, Mingming ZHANG, Weiguo WU. GC bypass: decoupling GC from the flash translation layer to eliminate GC-induced long-tail latency inside SSD[J]. Journal of Zhejiang University Science C, 2026, 27(2): 1-16.

@article{title="GC bypass: decoupling GC from the flash translation layer to eliminate GC-induced long-tail latency inside SSD",
author="Shiqiang NIE, Jie NIU, Yingzhao SHAO, Xiaobo LI, Mingming ZHANG, Weiguo WU",
journal="Journal of Zhejiang University Science C",
volume="27",
number="2",
pages="1-16",
year="2026",
publisher="Zhejiang University Press & Springer",
doi="10.1631/ENG.ITEE.2025.0152"
}

%0 Journal Article
%T GC bypass: decoupling GC from the flash translation layer to eliminate GC-induced long-tail latency inside SSD
%A Shiqiang NIE
%A Jie NIU
%A Yingzhao SHAO
%A Xiaobo LI
%A Mingming ZHANG
%A Weiguo WU
%J Frontiers of Information Technology & Electronic Engineering
%V 27
%N 2
%P 1-16
%@ 1869-1951
%D 2026
%I Zhejiang University Press & Springer
%DOI 10.1631/ENG.ITEE.2025.0152

TY - JOUR
T1 - GC bypass: decoupling GC from the flash translation layer to eliminate GC-induced long-tail latency inside SSD
A1 - Shiqiang NIE
A1 - Jie NIU
A1 - Yingzhao SHAO
A1 - Xiaobo LI
A1 - Mingming ZHANG
A1 - Weiguo WU
J0 - Frontiers of Information Technology & Electronic Engineering
VL - 27
IS - 2
SP - 1
EP - 16
%@ 1869-1951
Y1 - 2026
PB - Zhejiang University Press & Springer
ER -
DOI - 10.1631/ENG.ITEE.2025.0152


Abstract: 
NAND flash-based solid-state drives (SSDs) have been adopted by many data centers due to their high performance and low power consumption. However, the physical characteristics of the underlying flash memory necessitate garbage collection (GC) operations. Valid page migration during GC contributes significantly to latency overhead while competing for flash channel bandwidth and controller resources with user I/O requests through shared physical paths, leading to path conflicts and elevated long-tail latency. The existing Venice scheme introduces a low-cost interconnected network with path reservation mechanisms to provide substantial path diversity for SSDs. Nevertheless, its fair scheduling policy lacks priority differentiation between I/O and GC requests. In this paper, we propose GC bypass, which leverages Venice’s path diversity while enforcing GC request transmission through dedicated controllers. GC bypass decomposes GC requests into sub-requests and assigns low priority to valid page writes, enabling high-priority operations including user I/O, valid page reads, and block erases, to preempt paths reserved by low-priority requests. Valid pages failing to secure reserved paths are temporarily buffered for retry. Experimental results demonstrate that GC bypass reduces the 99.99th percentile long-tail latency by up to 25% compared to Venice. GC bypass effectively mitigates interference between critical I/O operations and background maintenance tasks while maintaining the architectural benefits of path diversity.

Darkslateblue:Affiliate; Royal Blue:Author; Turquoise:Article

Reference

[1]Balasubramonian R, Kahng AB, Muralimanohar N, et al., 2017. CACTI 7: new tools for interconnect exploration in innovative off-chip memories. ACM Trans Archit Code Optim, 14(2):14.

[2]Cui JH, Chen FY, Li L, et al., 2024. SmartNetSSD: exploiting path resources for read performance improvement in network-based SSDs. IEEE 42nd Int Conf on Computer Design, p.356-359.

[3]Gao CM, Shi L, Di YJ, et al., 2017. Exploiting chip idleness for minimizing garbage collection-induced chip access conflict on SSDs. ACM Trans Des Autom Electron Syst, 23(2):15.

[4]Gao CM, Shi L, Li Q, et al., 2020a. Aging capacitor supported cache management scheme for solid-state drives. IEEE Trans Comput Aided Des Integr Circ Syst, 39(10):2230-2239.

[5]Gao CM, Shi L, Liu K, et al., 2020b. Boosting the performance of SSDs via fully exploiting the plane level parallelism. IEEE Trans Parall Distrib Syst, 31(9):2185-2200.

[6]JEDEC Solid State Technology Association (JSST Association), 2024. JESD230G: NAND Flash Interface Interoperability. Arlington, VA, USA.

[7]Kang W, Shin D, Yoo S, 2017. Reinforcement learning-assisted garbage collection to mitigate long-tail latency in SSD. ACM Trans Embed Comput Syst, 16(5s):134.

[8]Kim J, Kang S, Park Y, et al., 2022. Networked SSD: flash memory interconnection network for high-bandwidth SSD. 55th IEEE/ACM Int Symp on Microarchitecture, p.388-403.

[9]Lee J, Kim Y, Shipman GM, et al., 2013. Preemptible I/O scheduling of garbage collection for solid state drives. IEEE Trans Comput Aided Des Integr Circ Syst, 32(2):247-260.

[10]Li JH, Wang QP, Lee PPC, et al., 2020. An in-depth analysis of cloud block storage workloads in large-scale production. IEEE Int Symp on Workload Characterization, p.37-47.

[11]Mao B, Wu SZ, Duan LD, 2018. Improving the SSD performance by exploiting request characteristics and internal parallelism. IEEE Trans Comput Aided Des Integr Circ Syst, 37(2):472-484.

[12]Nadig R, Sadrosadati M, Mao HY, et al., 2023. Venice: improving solid-state drive parallelism at low cost via conflict-free accesses. Proc 50th Annual Int Symp on Computer Architecture, p.1-16.

[13]Narayanan D, Thereska E, Donnelly A, et al., 2009. Migrating server storage to SSDs: analysis of tradeoffs. Proc 4th ACM European Conf on Computer Systems, p.145-158.

[14]Paik JY, Cho ES, Jin RZ, et al., 2018. Selective-delay garbage collection mechanism for read operations in multichannel flash-based storage devices. IEEE Trans Consum Electron, 64(1):118-126.

[15]Qiu YH, Yin WB, Wang LL, 2021. A high-performance open-channel open-way NAND flash controller architecture. 31st Int Conf on Field-Programmable Logic and Applications, p.91-98.

[16]Ren TY, Du YJ, Cui JH, et al., 2025. Device-level optimization techniques for solid-state drives: a survey.

[17]Sha ZB, Li J, Song LH, et al., 2021. Low I/O intensity-aware partial GC scheduling to reduce long-tail latency in SSDs. ACM Trans Archit Code Optim, 18(4):46.

[18]Tavakkol A, Arjomand M, Sarbazi-Azad H, 2013. Network-on-SSD: a scalable and high-performance communication design paradigm for SSDs. IEEE Comput Arch Lett, 12(1):5-8.

[19]Tavakkol A, Gómez-Luna J, Sadrosadati M, et al., 2018. MQSim: a framework for enabling realistic studies of modern multi-queue SSD devices. 16th USENIX Conf on File and Storage Technologies, p.49-65.

[20]Wang Y, Sun ZB, Zhou Y, et al., 2024. Balloon-ZNS: constructing high-capacity and low-cost ZNS SSDs with built-in compression. Proc 61st ACM/IEEE Design Automation Conf, p.125.

[21]Yan SQ, Li HC, Hao MZ, et al., 2017. Tiny-tail flash: near-perfect elimination of garbage collection tail latencies in NAND SSDs. ACM Trans Stor, 13(3):22.

Open peer comments: Debate/Discuss/Question/Opinion

<1>

Please provide your name, email address and a comment





Journal of Zhejiang University-SCIENCE, 38 Zheda Road, Hangzhou 310027, China
Tel: +86-571-87952783; E-mail: cjzhang@zju.edu.cn
Copyright © 2000 - 2026 Journal of Zhejiang University-SCIENCE