Model-Based Reinforcement Learning Control for Non-Linear Dynamics

Nusrat Farheen
Golam Gause Jaman
Marco P. Schoen

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Safe and sample-conscious controller synthesis for nonlinear dynamics benefits from reinforcement learning that exploits a model of the plant. A nonlinear mass–spring–damper with hardening effects and hard stops is considered. Two data-driven models are employed to enable off-plant training: a piecewise linear model assembled from operating-region linear descriptions and blended by triangular memberships, and a global nonlinear autoregressive model with exogenous input constructed from past inputs and outputs. Q-learning is performed with the model in the loop using an error-indexed discrete state space, a finite force alphabet, and a reward that balances absolute tracking error with its short-horizon decrease. When the trained agents are deployed on the true plant for reference tracking, the piecewise linear model tends to yield tighter regulation near the setpoint and reduced steady-state bias, while the nonlinear autoregressive route requires less prior structural knowledge and a simpler data-collection campaign, at the cost of larger residual error in the tested scenario. These findings indicate that model-based Q-learning with data-driven models enables off-plant policy learning while containing experimental risk. Observed performance reflects a trade-off between fidelity obtained from localized linearization and generality afforded by global nonlinear regression, as well as design choices in state discretization and reward shaping. Prospective improvements include adaptive membership shaping, richer regressors, and limited on-plant refinement to reduce model–plant mismatch.

Version published to 10.20944/preprints202511.0040.v1
Nov 3, 2025

Autonomous real-time control of turbulent dynamics

This article has 5 authors:
1. Georgios Rigas
2. Junjie Zhang
3. Chengwei Xia
4. Xianyang Jiang
5. Isabella Fumarola
This article has no evaluationsLatest version Sep 26, 2025
Dynamic-Error-Interval-Based Event-Triggered Adaptive Saturated Tracking Control of Uncertain Nonlinear Systems

This article has 6 authors:
1. Li-Yao Wang
2. Yue Long
3. Qingyu Su
4. Zhen-Xing Zhang
5. Xiao-Qi Zhao
6. Guang-Xin Zhong
This article has no evaluationsLatest version Oct 29, 2025
A robust prediction-based H-infinity control strategy forneutral-type time-delay systems with input delay

This article has 4 authors:
1. Mohammad Amin Faghihi
2. Shabnam Tashakori
3. Alireza Timas
4. Amirhossein Nikoofard
This article has no evaluationsLatest version Sep 22, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Autonomous real-time control of turbulent dynamics

Dynamic-Error-Interval-Based Event-Triggered Adaptive Saturated Tracking Control of Uncertain Nonlinear Systems

A robust prediction-based H-infinity control strategy forneutral-type time-delay systems with input delay