Zero-Knowledge Cross-User De-Duplication for Big Data Storage on Cloud

R N Karthika
C Valliyammai
P Sundaravadivel
R.Augustian Isaac

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

The rise of cloud computing and the prodigious volume of data in cloud storage has been a forerunner and expediter to the emergence of big data. Cloud computing has become the platform for centralized pools of resources such as applications, networks and storage services. As there is an enormous increase in data, it leads to duplicate/redundant copies of information on cloud servers. To eliminate that redundant information, data de-duplication has become the mainstream technology in cloud storage. In the view of removing redundant data, secure cross user source-based de-duplication and integrity auditing delegation technique have been perlustrated. In this paper, a pooled technique is proposed to perform both Zero-Knowledge and public integrity auditing of data in big data storage. De-duplication, along with data integrity, frees up the space in the cloud server and ensures the security of the stored data. The originality of the proposed methodology covers three folds. Firstly, an enhanced cuckoo hashing algorithm is used to identify duplicate chunks, which improves the overall performance as the time required to find is significantly less compared to traditional hashing algorithms. Secondly, after hashing, the de-duped data hash index is transmitted and checked for de-duplication in Cloud Service Provider (CSP). The missing hash values of the corresponding data chunk alone will be sent to the CSP. If there is any data loss during this transmission, it can be recovered through Reed-Solomon (RS) mechanism. Thirdly, the proposed key exchange algorithm inherits the security enhancement of the Advanced Encryption Standard (AES). Several experiments are conducted on the proposed technique with state-of-art under various use cases and user scenarios. The proposed framework uses less computation power and frees up storage space up to 86.77 % which is higher than other state-of-the-art deduplication algorithms. The upload and download response time for file encoding is stable even when the file size is more prodigious. Subsequently, the proposed zero-knowledge cross-user data de-dupe mechanism is stable despite the variation in file size.

Version published to 10.21203/rs.3.rs-5622401/v1 on Research Square
Dec 13, 2024

A Hierarchical User Online/Offline Revocable Encryption Scheme in End-Edge-Cloud Networks

This article has 6 authors:
1. Jing Zhao
2. Rong Wang
3. Yuanfei Yao
4. Mengchun Xia
5. Weixi Zhou
6. Tingting Kou
This article has no evaluationsLatest version Dec 16, 2025
Resilient and Verifiable Outsourced Attribute-Based Non-Interactive Oblivious Transfer Protocol for Tactical Edge Networks

This article has 3 authors:
1. Weiwei Liu
2. Binghao Fu
3. Lulin Wang
This article has no evaluationsLatest version Jan 21, 2026
A Comprehensive Evaluation of Privacy-Preserving Mechanisms in Cloud-Based Big Data Analytics: Challenges and Future Research Directions

This article has 2 authors:
1. Steven Coleman
2. Daniel Wilson
This article has no evaluationsLatest version Jan 15, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

A Hierarchical User Online/Offline Revocable Encryption Scheme in End-Edge-Cloud Networks

Resilient and Verifiable Outsourced Attribute-Based Non-Interactive Oblivious Transfer Protocol for Tactical Edge Networks

A Comprehensive Evaluation of Privacy-Preserving Mechanisms in Cloud-Based Big Data Analytics: Challenges and Future Research Directions