Solving Counterfactual Regret Minimization for Bayesian Games with Continuous Type Spaces

Zuyuan Zhang
Mahdi Imani
Tian Lan

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Bayesian games model interactive decision-making where players have incomplete information -- e.g., regarding payoffs of the game or private data on players' strategies/preferences. Thus, players must actively reason and update their beliefs about the game and about other players using the notion of type. Existing work on counterfactual regret (CFR) minimization have shown great success for solving games with complete or imperfect information. However, Bayesian games with continuous type spaces cannot be converted into an equivalent non-Bayesian extensive-form game and remain difficult to solve. To this end, we propose a new Bayesian-CFR minimization algorithm achieving a guaranteed regret bound with respect to Nash Equilibria in Bayesian games with continuous type spaces. In particular, we show that the cumulative regret is continuous with respect to type distribution and thus can be estimated using finite samples in the type space. We then propose a kernel-density estimate that is shown to converge to the true type distribution. These results allow us to efficiently solve Bayesian-CFR minimization with respect to the Bayesian Nash equilibrium. We finally extend this new approach to Bayesian-CFR+ and Deep Bayesian CFR. Experimental results show that our solution significantly outperforms existing methods with minimumexploitability in Bayesian games with continuous type spaces such as modified Texas Hold'em.

Version published to 10.21203/rs.3.rs-9004720/v1 on Research Square
Mar 13, 2026

Regret Is Weighted Forgetting

This article has 1 author:
1. Michael Timothy Bennett
This article has no evaluationsLatest version Mar 19, 2026
Regret Equals Covariance: A Closed-Form Characterization for Stochastic Optimization

This article has 1 author:
1. Irene Aldridge
This article has no evaluationsLatest version Feb 26, 2026
Disentangling the Effects of Counterfactual Feedback on Maximization and Risk Preference across Gains and Losses

This article has 6 authors:
1. Ali Shiravand
2. Maëlle Gueguen
3. Sophie Bavard
4. Dirk Wulff
5. Julien Bastin
6. Stefano Palminteri
This article has no evaluationsLatest version Apr 10, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Regret Is Weighted Forgetting

Regret Equals Covariance: A Closed-Form Characterization for Stochastic Optimization

Disentangling the Effects of Counterfactual Feedback on Maximization and Risk Preference across Gains and Losses