Adaptive Fault Resilience Techniques for Flash Memory Used in DNN Accelerators

Shyue-Kung Lu
Yi-Zheng Wu

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Deep neural networks (DNNs) are increasingly utilized in various applications, including smart appliances, facial recognition, and autonomous driving. The weight data generated during training are typically stored in flash memory, which is prone to reliability and endurance challenges. Given the inherent error tolerance of DNN applications, adaptive fault resilience techniques have been proposed to safeguard the weight data stored in flash memory. Initially, an analysis of bit significance is conducted to ascertain the priority of weight bits that require protection. Subsequently, a novel weight transposer and an address remapper are introduced to reallocate significant weight bits to more reliable or fault-free flash memory cells. A bipartite graph model is developed to facilitate the modeling of address remapping and the assessment of error scores. Additionally, corresponding hardware architectures for address remapping are proposed. The deep learning framework PyTorch is employed to evaluate inference accuracy across various DNN models. Experimental results indicate that, with an injected bit error rate (BER) of 0.01% in the weight data, the accuracy losses for commonly used DNN models remain below 1%, accompanied by negligible hardware overhead.

Version published to 10.21203/rs.3.rs-8429480/v1 on Research Square
Dec 24, 2025

ZOE: Zero Overhead ECC Techniques for Flash Memory Used in AI Accelerators

This article has 2 authors:
1. Shyue-Kung Lu
2. Yi-Zheng Wu
This article has no evaluationsLatest version Dec 25, 2025
Chip-level voltage controlled magnetic p-bits array with variation compensation for stochastic neural network computation

This article has 16 authors:
1. Yi Cao
2. Jiachen Bao
3. Songsong Li
4. Ruizhi Ren
5. Zheng Zhu
6. Di Wu
7. Jiayong Wang
8. Minggao Zuo
9. Mingke Li
10. Duanzihua Cheng
11. Jianyu Lv
12. Lintong Zou Zou
13. Chuyue Liu
14. Wangyang Hu
15. Yuesheng Li Li
16. Guanghua Yu
This article has no evaluationsLatest version Jan 28, 2026
LoRPIA: Low-power Reconfigurable Pallet-Integrated Accelerator for Depthwise Separable Convolutions

This article has 2 authors:
1. Sajad Eydivandi
2. Hakem Beitollahi
This article has no evaluationsLatest version Jan 8, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

ZOE: Zero Overhead ECC Techniques for Flash Memory Used in AI Accelerators

Chip-level voltage controlled magnetic p-bits array with variation compensation for stochastic neural network computation

LoRPIA: Low-power Reconfigurable Pallet-Integrated Accelerator for Depthwise Separable Convolutions