Assessment of the Dyad Agentic Modeling Language in Process Systems Engineering

Fernando Arrais Romero Dias Lima
Anas Abdelrehim
Ashutosh Bharambe
Marius Micluța-Câmpeanu
Dhairya Gandhi
Venkateshprasad Bhat
Anshul Singhvi
Morten Piibeleht
Argimiro Secchi
Mauricio Bezerra de Souza
Mumin Enis Leblebici
Christopher Rackauckas
Idelfonso Nogueira

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

In this work, Dyad, a modeling and simulation language with specialized multi-agent large language model (LLM) workflows, is introduced for Process Systems Engineering (PSE) applications. Dyad is designed to overcome key limitations of general-purpose LLMs, such as context collapse, retrieval failures, and physically inconsistent hallucinations, by decomposing engineering tasks across coordinated agents equipped with domain-specific tools, optimization routines, and access to mechanistic models. Using crystallization as a testbed, we evaluate Dyad against the main general-purpose agentic AI systems (ChatGPT 5.1 and Gemini 3) on three canonical PSE challenges: (i) soft sensing and system monitoring via an ATR-FTIR calibration problem for paracetamol in ethanol--water mixtures; (ii) dynamic modeling of potassium sulfate crystallization and dissolution through population balance models (PBMs) that Dyad iteratively refines, calibrates, and updates when necessary; and (iii) nonlinear model predictive control (NMPC) for batch crystallization, where Dyad proposes control objectives, constraints, and tuning parameters that achieve near set-point tracking with computation times compatible with real-time operation. Across these case studies, Dyad not only proposes state-of-the-art modeling strategies, but also automatically selects model structures, estimates parameters, and suggests refinements that improve extrapolation, particularly near equilibrium conditions, where it recovers physically consistent steady states while alternative PBMs exhibit nonphysical drift. When compared with general-purpose LLMs (ChatGPT and Google Gemini) prompted on the same tasks, Dyad delivers more accurate models, with coefficients of determination exceeding 0.98 for calibration tasks and reductions in mean absolute percentage error from over 100% to below 50% for crystallization dynamics, as well as implementable NMPC formulations with computation times below 7 s per control move. In contrast, other LLMs remain confined to high-level suggestions or approximate parameter guesses and do not produce calibrated models or controllers suitable for real-time operation. These results position specialized multi-agent LLM workflows as practical, trustworthy assistants for accelerating monitoring, modeling, model update, and control in chemical and process industries.

Version published to 10.21203/rs.3.rs-8475139/v1 on Research Square
Jan 13, 2026

Reasoning in Large Language Models: From Chain-of-Thought to Massively Decomposed Agentic Processes

This article has 8 authors:
1. Yiming Lei
2. Jiawei Xu
3. Chia Xin Liang
4. Ziqian Bi
5. Xiaoming Li
6. Danyang Zhang
7. Junhao Song
8. Zhenyu Yu
This article has no evaluationsLatest version Dec 24, 2025
Towards a Science of Scaling Agent Systems

This article has 20 authors:
1. Yubin Kim
2. Ken Gu
3. Chanwoo Park
4. Chunjong Park
5. Samuel Schmidgall
6. A. Ali Heydari
7. Yao Yan
8. Zhihan Zhang
9. Yuchen Zhuang
10. Yun Liu
11. Mark Malhotra
12. Paul Liang
13. Hae Won Park
14. Yuzhe Yang
15. Xuhai Xu
16. Yilun Du
17. Shwetak Patel
18. Tim Althoff
19. Daniel McDuff
20. Xin Liu
This article has no evaluationsLatest version Jan 23, 2026
Tool and Agent Selection for Large Language Model Agents in Production: A Survey

This article has 9 authors:
1. Elias Lumer
2. Anmol Gulati
3. Faheem Nizar
4. Dzmitry Hedroits
5. Atharva Mehta
6. Henry Hwangbo
7. Vamse Kumar Subbiah
8. Pradeep Honaganahalli Basavaraju
9. James A. Burke
This article has no evaluationsLatest version Dec 12, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Reasoning in Large Language Models: From Chain-of-Thought to Massively Decomposed Agentic Processes

Towards a Science of Scaling Agent Systems

Tool and Agent Selection for Large Language Model Agents in Production: A Survey