Cognitive erasure-coded data update and repairfor mitigating I/O overhead

Bing Wei
Ming Zhong
Qian Chen
Yi Wu
Yubin Li

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

In erasure-coded storage systems, updating data necessitates parity updates to maintain data consistency, which leads to I/O amplification due to "write-after-read" operations. Additionally, the scattered storage of parity updates imposes significant disk seek overhead during data repair. To address these challenges, this paper proposes a Cognitive Update and Repair Method (CURM), which uses machine learning to classify files into write-only, read-only, and read-write categories, enabling customized update and repair strategies. For write-only and read-write files, CURM utilizes data difference and fine-grained I/O scheduling to reduce I/O overhead. Furthermore, CURM reserves disk space adjacent to parity chunks for read-write files, enabling efficient parallel reads and minimizing seek cost during repair. We implement CURM in a prototype storage system and evaluate its performance using real-world NFS and MSR workloads on a 25-node cluster. Experimental results show that CURM improves data update throughput by up to 82.52% and reduces data recovery time by up to 47.47%, while achieving lower storage overhead compared to state-of-the-art approaches including FL, PL, PLR, and PARIX. These results demonstrate CURM’s effectiveness in enhancing both update and recovery performance for large-scale erasure-coded storage systems.

Version published to 10.21203/rs.3.rs-5794537/v1 on Research Square
Apr 9, 2025

Adaptive NVM Word Compression Based on Cache Line Dynamics on Micro-Architecture <i></i>

This article has 4 authors:
1. Jialin Wang
2. Zhen Yang
3. Zhenghao Yin
4. Yajuan Du
This article has no evaluationsLatest version Apr 15, 2025
SOT-MRAM-enabled noise-tolerant and resource-saving probabilistic binary neural network

This article has 16 authors:
1. Xufeng Kou
2. Puyang Huang
3. Yu Gu
4. Chenyi Fu
5. Tianhao Chen
6. Shan Yao
7. Zhenghang Zhi
8. Jiaqi Lu
9. Yongqi Hu
10. Hongchao Zhang
11. Shiyang Lu
12. Yumeng Yang
13. Tianxiao Nie
14. Shouzhong Peng
15. Weisheng Zhao
16. Kang Wang
This article has no evaluationsLatest version Mar 27, 2025
Scalable high-performance single cell data analysis with BPCells

This article has 2 authors:
1. Benjamin Parks
2. William Greenleaf
This article has no evaluationsLatest version Apr 1, 2025

Listed in

Abstract

Article activity feed

Related articles

Adaptive NVM Word Compression Based on Cache Line Dynamics on Micro-Architecture <i></i>

SOT-MRAM-enabled noise-tolerant and resource-saving probabilistic binary neural network

Scalable high-performance single cell data analysis with BPCells