ZOE: Zero Overhead ECC Techniques for Flash Memory Used in AI Accelerators

Shyue-Kung Lu
Yi-Zheng Wu

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

One important role of flash memory is to store the trained weights for the state-of-the-art deep neural networks (DNNs). However, flash memory suffers from many reliability and endurance issues. Therefore, erroneous weights can degrade the classification accuracy, which is unacceptable for mission-critical applications. To address these challenges, a highly efficient zero-overhead Error Correction Code (ECC) technique named ZOE is proposed in this paper. By exploiting redundancies for weight representation, weights are partitioned into reducible weights (RWs) and irreducible weights (IRWs). Reducible weights can be represented with a shorter weight length and the saved bits can be used for storing check bits of the adopted ECC. For most DNN models, the weight values are mostly close to zero; therefore, the proportions of RWs are usually very high, allowing for many bits to be saved for ECC. Moreover, since a codeword of flash memory typically consists of many weights, the proportions of RW s in all codewords might vary. This variation can compromise the reduction efficiency of ZOE. Therefore, we also propose the weight leveling technique, which can evenly distribute RWs to all codewords. The algorithm for deriving control words to orchestrate the leveling is also proposed. The corresponding hardware architectures of ZOE are then developed. Experimental results demonstrate that the reliability and accuracy of typical DNN models can be significantly improved with negligible hardware overhead.

Version published to 10.21203/rs.3.rs-8420084/v1 on Research Square
Dec 25, 2025

Adaptive Fault Resilience Techniques for Flash Memory Used in DNN Accelerators

This article has 2 authors:
1. Shyue-Kung Lu
2. Yi-Zheng Wu
This article has no evaluationsLatest version Dec 24, 2025
LoRPIA: Low-power Reconfigurable Pallet-Integrated Accelerator for Depthwise Separable Convolutions

This article has 2 authors:
1. Sajad Eydivandi
2. Hakem Beitollahi
This article has no evaluationsLatest version Jan 8, 2026
Variable Bit-Width All-Optical Content-Addressable Memory Enabled by Sb2Se3 for Similarity Search

This article has 8 authors:
1. Yi Guo
2. Xinmeng Hao
3. Yibo Zhang
4. Guangsong Yuan
5. Hongxiang Guo
6. Bing Song
7. Jian Wu
8. Qingjiang Li
This article has no evaluationsLatest version Jan 29, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Adaptive Fault Resilience Techniques for Flash Memory Used in DNN Accelerators

LoRPIA: Low-power Reconfigurable Pallet-Integrated Accelerator for Depthwise Separable Convolutions

Variable Bit-Width All-Optical Content-Addressable Memory Enabled by Sb2Se3 for Similarity Search