Chemotherapy dose scheduling via Q-learning in a Markov tumor model

M. Giles
P.K. Newton

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

We describe a Q-learning approach to optimized chemotherapy dose scheduling in a stochastic finite-cell Markov process that models tumor cell natural selection dynamics. The three competing subpopulations comprising our virtual tumor are a chemo-sensitive population (S), and two chemo-resistant populations, R ₁ and R ₂ , each resistant to one of two drugs, C ₁ and C ₂ . The two drugs are toggled off or on which constitute the actions (selection pressure) imposed on our state-variables ( S, R ₁ , R ₂ ), measured as proportions in our finite state-space of N cancer cells ( S + R ₁ + R ₂ = N ). After the converged chemo-dosing policies are obtained, corresponding to a given reward structure, we focus on three important aspects of chemotherapy dose scheduling. First, we identify the most likely evolutionary paths of the tumor cell populations in response to the optimized (converged) policies. Second, we quantify the robustness in the ability to reach our target of balanced co-existence in light of incomplete information in both the initial cell-populations as well as the state-variables at each step. Third, we evaluate the efficacy of simplified policies which exploit the symmetries uncovered from an examination of the full policy. Our reward structure is designed to delay the onset of chemo-resistance in the tumor by rewarding a well-balanced mix of co-existing states, while punishing unbalanced subpopulations to avoid extinction.

Version published to 10.1101/2025.06.05.658118v1 on bioRxiv
Jun 8, 2025

Mechanistic modeling and machine learning identifies optimum radiotherapy schedules to prevent treatment-induced metastasis

This article has 10 authors:
1. Christopher Graser
2. Zhi Zhou
3. Manuel Schürch
4. Graydon Moorhead
5. Justin Dean
6. Cindy Lin
7. Jamie Dean
8. David Kozono
9. Galit Lahav
10. Franziska Michor
This article has no evaluationsLatest version Jul 5, 2025
tugMedi: simulator of cancer-cell evolution for personalized medicine based on the genomic data of patients

This article has 6 authors:
1. Iurii Nagornov
2. Eisaku Furukawa
3. Momoko Nagai
4. Shigehiro Yagishita
5. Tatsuhiro Shibata
6. Mamoru Kato
This article has no evaluationsLatest version Jul 3, 2025
A mechanistic model for dynamics between CAR T cells and target cells captures features that determine killing profiles

This article has 8 authors:
1. Suriya Selvarajan
2. Kevin L. Scrudders
3. Kenneth Rodriguez-Lopez
4. Suilan Zheng
5. Bo Huang
6. Philip S. Low
7. Raghu Pasupathy
8. Shalini T. Low-Nam
This article has no evaluationsLatest version Jun 27, 2025

Listed in

Abstract

Article activity feed

Related articles

Mechanistic modeling and machine learning identifies optimum radiotherapy schedules to prevent treatment-induced metastasis

tugMedi: simulator of cancer-cell evolution for personalized medicine based on the genomic data of patients

A mechanistic model for dynamics between CAR T cells and target cells captures features that determine killing profiles