Efficient Operator Learning with Derivative-Enhanced Parameter Sensitivity Information and Hybrid Optimization

Jesus Gonzalez-Sieiro
David Pardo
Victor Calo

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

We present two complementary strategies that improve the performance of Deep Operator Networks (DeepONets) for parametric partial differential equations (PDEs). The first one enriches the loss function with parameter derivatives, inspired by Hermite interpolation, thereby enhancing generalization even when only limited training data are available. We generate both solutions and parameter sensitivities by extending the OpenFOAM finite volume method (FVM) solver with automatic differentiation. The second strategy is a hybrid training scheme that combines gradient descent (GD) with least-squares (LS) optimization. By isolating the final linear layer and solving it exactly via LS at each GD step, we accelerate convergence and reduce training error. A distinctive feature of our formulation is the treatment of parametric inputs: rather than evaluating high-dimensional fields across the entire domain, we project them onto a coarse parameter mesh before passing them to the branch network, substantially lowering input dimension and network complexity while retaining accuracy. Numerical experiments on convection–diffusion problems with heterogeneous diffusivity and velocity fields confirm that this framework consistently reduces error and improves efficiency, highlighting the promise of derivative-enhanced, hybrid-trained DeepONets for complex parametric PDEs.

Version published to 10.21203/rs.3.rs-7658901/v1 on Research Square
Oct 30, 2025

AURA: An Adaptive Unified Regularization Approach for Gradient-Based Optimization

This article has 1 author:
1. Keshav Gupta
This article has no evaluationsLatest version Sep 9, 2025
The Finite Element Neural Network Method: Leveraging Non-Vanishing Shape Functions in Space-Time-Parameter Framework

This article has 8 authors:
1. Mohammed Abda
2. Lucas Berthet
3. Mohsen Hamedi
4. Dani Hibatullah
5. Elsa Piollet
6. Christopher Blake
7. Bruno Blais
8. Frédérick P. Gosselin
This article has no evaluationsLatest version Oct 15, 2025
Meta-learning physics-informed neural networks for few-shot parameter inference

This article has 4 authors:
1. Brandon Yee
2. Wilson Collins
3. Benjamin Pellegrini
4. Caden Wang
This article has no evaluationsLatest version Sep 3, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

AURA: An Adaptive Unified Regularization Approach for Gradient-Based Optimization

The Finite Element Neural Network Method: Leveraging Non-Vanishing Shape Functions in Space-Time-Parameter Framework

Meta-learning physics-informed neural networks for few-shot parameter inference