arXiv

Hybrid Neural Ordinary Differential Equations for Data-Efficient Polymerization Modeling with Incomplete Kinetics

Title: Leveraging Hybrid Neural Ordinary Differential Equations for Efficient Polymerization Modeling Amidst Kinetic Uncertainty

Abstract:

Precise forecasting of polymerization behavior is a cornerstone for effective process design, control strategies, and optimization. However, traditional approaches face significant hurdles: mechanistic models necessitate time-consuming parameterization of kinetics that are only partially understood, whereas purely data-driven methods rely on extensive, high-quality datasets that are often prohibitively expensive to acquire, especially during the initial phases of design. To address these challenges, we introduce a hybrid Neural Ordinary Differential Equation (NODE) framework designed for data-efficient modeling of free-radical polymerization processes.

Utilizing the batch polymerization of methyl methacrylate (MMA) as a primary case study, our approach explicitly retains mechanistic mass balances. Within this structure, a neural network surrogate learns only the partially characterized effective radical concentration that drives monomer consumption, while well-established reactions—including initiator decomposition, propagation, and termination—are modeled physically.

We benchmarked the hybrid NODE against a purely data-driven NODE and a discrete-time feedforward neural network under conditions of sparse data, training all models on datasets containing as few as ten measurements, regardless of whether the sampling was regular or irregular. The hybrid NODE consistently outperformed the purely data-driven baselines, yielding lower prediction errors and generating extrapolations that are more physically consistent.

In a generalization test involving noisy data and previously unseen operating conditions, the hybrid NODE achieved a Root Mean Square Error (RMSE) of 0.013. This stands in stark contrast to the RMSE values of 0.31 for the data-driven NODE and 0.68 for the discrete-time model. These results demonstrate that isolating the learning process to a single closure term, rather than attempting to capture full dynamics, is sufficient to ensure reliable predictions even when data availability is severely limited.


Source: arXiv Generated at: 2026-06-02 00:00:00 UTC

Related Articles

Law’s Billable Hour Is Being Shredded by AI
Bloomberg

Law’s Billable Hour Is Being Shredded by AI

AI is dismantling the billable hour by automating routine legal tasks. This technological shift threatens the traditiona...

Iran War: Trump Tries to Stop Israel’s Lebanon Push | The Opening Trade 6/2/2026
Bloomberg

Iran War: Trump Tries to Stop Israel’s Lebanon Push | The Opening Trade 6/2/2026

SoftBank in Early Talks to Back $800 Million Agile Robots Round
Bloomberg

SoftBank in Early Talks to Back $800 Million Agile Robots Round

SoftBank is in early talks to back Agile Robots’ $800 million funding round. The Japanese tech giant is currently in pre...

Amundi Is Diversifying Risk Via Commodity Currencies, Gold
Bloomberg

Amundi Is Diversifying Risk Via Commodity Currencies, Gold

Amundi diversifies risk by investing in commodity-linked currencies and gold. This strategy hedges against market volati...

Reuters

Marvell Technology surges after Nvidia's Huang calls it 'next trillion-dollar company'

Marvell Technology shares surged after Nvidia CEO Jensen Huang labeled the firm the “next trillion-dollar company.”

Russia Says It Found Foreign Spyware on Top Officials’ Phones
Bloomberg

Russia Says It Found Foreign Spyware on Top Officials’ Phones

Russia’s FSB claims to have discovered foreign spyware on senior officials’ phones. Moscow attributes the intrusion to h...