FedRLHF: A Convergence-Guaranteed Federated Framework for Privacy-Preserving and Personalized RLHF

Flint Xiaofeng Fan

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

In the era of increasing privacy concerns and demand for personalized experiences, traditional Reinforcement Learning with Human Feedback (RLHF) frameworks face significant challenges due to their reliance on centralized data. We introduce Federated Reinforcement Learning with Human Feedback (FedRLHF), a novel framework that decentralizes the RLHF process.FedRLHF enables collaborative policy learning across multiple clients, such as Large Language Models (LLMs) finetuning, without sharing raw data or human feedback, thereby ensuring robust privacy preservation.Leveraging federated reinforcement learning, each client integrates human feedback locally into reward functions and updates their policies through personalized RLHF processes. We establish rigorous theoretical foundations for FedRLHF, providing convergence guarantees, and deriving sample complexity bounds that scale efficiently with the number of clients. Empirical evaluations on the MovieLens and IMDb datasets demonstrate that FedRLHF preserves user privacy, achieves performance on par with centralized RLHF, and enhances personalization across diverse client environments.

Version published to 10.31219/osf.io/pa7bk_v1 on OSF Preprints
Apr 28, 2025

Federated Learning with Differential Privacy for Sensitive Domains

This article has 2 authors:
1. James Henderson
2. Racheal Writz
This article has no evaluationsLatest version Jun 16, 2025
MMVO-SHFL: A Fair and Efficient Hierarchical Federated Learning

This article has 3 authors:
1. Xia Liu
2. Jianping Wang
3. Danyang Chen
This article has no evaluationsLatest version Jul 11, 2025
Verifiable and Privacy-Preserving Decentralized Collaboration for Machine Learning Model Improvement

This article has 3 authors:
1. Jay Bojič Burgos
2. Urban Sedlar
3. Matevž Pustišek
This article has no evaluationsLatest version Jun 9, 2025

Listed in

Abstract

Article activity feed

Related articles

Federated Learning with Differential Privacy for Sensitive Domains

MMVO-SHFL: A Fair and Efficient Hierarchical Federated Learning

Verifiable and Privacy-Preserving Decentralized Collaboration for Machine Learning Model Improvement