Hybrid modeling framework for bioprocesses with minimal prior knowledge and limited data

Carlos Martínez
Facundo Rocha Calvette
Marielle Péré
Mauricio Barrientos
Sebastián Ossandón

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Hybrid models that couple mechanistic ordinary differential equations (ODEs) with neural networks are increasingly used in bioprocess engineering, yet most published approaches assume either substantial prior knowledge or relatively large datasets. This work proposes a hybrid modeling framework for early-stage bioprocess development, where only a few batch experiments are available and standard artificial intelligence (AI) techniques are difficult to apply. The mechanistic structure is constructed using only qualitative, widely accepted biological constraints (e.g., non-negativity, zero-invariance, and biomass-mediated interactions), while unknown functional dependencies are learned by a feedforward neural network embedded in the ODE right-hand side. To exploit the natural organization of batch data, we introduce a minibatch training strategy in which each minibatch corresponds to one entire batch experiment, combined with regularization to mitigate overfitting. We demonstrate the approach on (i) synthetic Escherichia coli growth with overflow metabolism and (ii) experimental astaxanthin production by Xanthophyllomyces dendrorhous . In both cases, models trained from as few as three batch experiments accurately predict an unseen validation batch and the learned neural components recover biologically consistent patterns. Thus, the framework contributes to AI by enabling constrained neural differential models that learn interpretable dynamics from limited, structured data, with applications to early-stage bioprocess engineering.

Version published to 10.1101/2025.11.14.688550 on bioRxiv
Nov 16, 2025

Application of a TimeXer Model Incorporating ECM Based Features in Battery Remaining Useful Life Prediction

This article has 7 authors:
1. pei tang
2. lihui liu
3. wenbo lei
4. zetao qiu
5. Zhongran Yao
6. xiaoyong gu
7. changcheng sun
This article has no evaluationsLatest version Jan 20, 2026
An LLM-Agentic Workflow for Data-Driven Modeling: From Image Reconstruction to Thermodynamic Modeling

This article has 2 authors:
1. Guannan Tang
2. Noah Paulson
This article has no evaluationsLatest version Jan 20, 2026
Application of Explainable Artificial Intelligence (XAI) in Combination with Bootstrap to Improve Processes in Model-Based Aero-Engine Development

This article has 6 authors:
1. Klaus Markgraf
2. Katja Müller
3. Clara Henkel
4. Robert Flassig
5. Christian Janke
6. Peter Flassig
This article has no evaluationsLatest version Jan 28, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Application of a TimeXer Model Incorporating ECM Based Features in Battery Remaining Useful Life Prediction

An LLM-Agentic Workflow for Data-Driven Modeling: From Image Reconstruction to Thermodynamic Modeling

Application of Explainable Artificial Intelligence (XAI) in Combination with Bootstrap to Improve Processes in Model-Based Aero-Engine Development