If you see this, something is wrong

Collapse and expand sections

To get acquainted with the document, the best thing to do is to select the "Collapse all sections" item from the "View" menu. This will leave visible only the titles of the top-level sections.

Clicking on a section title toggles the visibility of the section content. If you have collapsed all of the sections, this will let you discover the document progressively, from the top-level sections to the lower-level ones.

Cross-references and related material

Generally speaking, anything that is blue is clickable.

Clicking on a reference link (like an equation number, for instance) will display the reference as close as possible, without breaking the layout. Clicking on the displayed content or on the reference link hides the content. This is recursive: if the content includes a reference, clicking on it will have the same effect. These "links" are not necessarily numbers, as it is possible in LaTeX2Web to use full text for a reference.

Clicking on a bibliographical reference (i.e., a number within brackets) will display the reference.

Speech bubbles indicate a footnote. Click on the bubble to reveal the footnote (there is no page in a web document, so footnotes are placed inside the text flow). Acronyms work the same way as footnotes, except that you have the acronym instead of the speech bubble.

Discussions

By default, discussions are open in a document. Click on the discussion button below to reveal the discussion thread. However, you must be registered to participate in the discussion.

If a thread has been initialized, you can reply to it. Any modification to any comment, or a reply to it, in the discussion is signified by email to the owner of the document and to the author of the comment.

First published on Sunday, Jun 7, 2026 and last modified on Wednesday, Jun 10, 2026 by François Chaplais.

PINNs Failure Modes are Overfitting

Nigel T. Andersen Graduate School of Information Science and Technology Email

Takashi Matsubara Graduate School of Information Science and Technology and RIKEN Center for Advanced Intelligence Project (AIP) Email

Abstract

Table 1. Comparison of double PINN at FP64 with the current top performing methods
Model	³Loss	rMAE	rRMSE	²\( N\)
Convection
¹PINNsFormer [21]	9.0e-4 \( \pm\) 1.0e-4	3.3e-2 \( \pm\) 6.8e-3	4.4e-2 \( \pm\) 7.3e-3	10403
¹PINNMamba [22]	1.0e-4 \( \pm\) 2.0e-5	1.8e-2 \( \pm\) 3.7e-3	2.0e-2 \( \pm\) 3.8e-3	10403
PINN_FP64 [20]	5.0e-6 \( \pm\) 1.0e-6	5.9e-3 \( \pm\) 1.3e-3	7.2e-3 \( \pm\) 1.7e-3	10403
double PINN (Ours)	4.0e-11 \( \pm\) 4.1e-11	4.9e-6 \( \pm\) 4.4e-6	5.4e-6 \( \pm\) 4.6e-6	800
Reaction
¹PINNsFormer [21]	3.0e-6 \( \pm\) 1.0e-6	1.5e-2 \( \pm\) 1.3e-3	3.0e-2 \( \pm\) 2.7e-3	10403
¹PINNMamba [22]	1.0e-6 \( \pm\) 1.0e-6	9.2e-3 \( \pm\) 1.7e-3	2.1e-2 \( \pm\) 3.6e-3	10403
PINN_FP64 [20]	1.0e-5 \( \pm\) 5.0e-6	2.7e-2 \( \pm\) 6.3e-3	5.0e-2 \( \pm\) 1.1e-2	10403
double PINN (Ours)	3.5e-12 \( \pm\) 3.5e-13	1.1e-5 \( \pm\) 2.3e-6	2.9e-5 \( \pm\) 8.0e-6	456
Wave
¹PINNsFormer [21]	2.3e-2 \( \pm\) 1.7e-3	3.5e-1 \( \pm\) 8.7e-2	3.6e-1 \( \pm\) 8.7e-2	10504
¹PINNMamba [22]	2.0e-4 \( \pm\) 3.0e-5	1.9e-2 \( \pm\) 3.3e-3	2.0e-2 \( \pm\) 3.3e-3	10504
PINN_FP64 [20]	4.2e-5 \( \pm\) 1.6e-5	8.0e-3 \( \pm\) 3.2e-3	8.1e-3 \( \pm\) 3.1e-3	10504
double PINN (Ours)	3.5e-7 \( \pm\) 8.9e-8	3.8e-4 \( \pm\) 1.0e-4	3.8e-4 \( \pm\) 1.0e-4	444
Allen–Cahn
¹PINNsFormer [21]	4.6e-1 \( \pm\) 2.9e-1	9.9e-1 \( \pm\) 4.0e-2	9.9e-1 \( \pm\) 4.2e-2	10504
¹PINNMamba [22]	2.7e-3 \( \pm\) 2.0e-4	1.4e-1 \( \pm\) 1.2e-2	2.7e-1 \( \pm\) 2.0e-2	10504
PINN_FP64 [20]	1.3e-5 \( \pm\) 4.0e-6	1.6e-2 \( \pm\) 3.6e-3	5.5e-2 \( \pm\) 1.1e-2	10504
double PINN (Ours)	4.0e-5 \( \pm\) 6.1e-10	9.6e-5 \( \pm\) 2.0e-5	3.6e-4 \( \pm\) 9.8e-5	4396
¹ PINNsFormer and PINNMamba data, and high-resolution numerical solution for Allen-Cahn taken from Ref. [20]
² \( N\) is the total number of collocation points (domain + initial + boundary).
³ Loss is not directly comparable across the methods due to the regularization penalty, but is included here for reference.

Appendix

References

[1] Dimitris C Psichogios and Lyle H Ungar A hybrid neural network-first principles approach to process modeling AIChE J. 1992 38 10 1499–1511

[2] Jiequn Han and Arnulf Jentzen and Weinan E Solving high-dimensional partial differential equations using deep learning Proc. Natl. Acad. Sci. U.S.A. 2018 115 34 8505–8510

[3] Isaac E Lagaris and Aristidis Likas and Dimitrios I Fotiadis Artificial neural networks for solving ordinary and partial differential equations IEEE Trans. Neural Netw. Learn. Syst. 1998 9 5 987–1000

[4] Weinan E and Bing Yu The deep Ritz method: a deep learning-based numerical algorithm for solving variational problems Commun. Math. Stat. 2018 6 1 1–12

[5] Zongyi Li and Nikola Kovachki and Kamyar Azizzadenesheli and Burigede Liu and Kaushik Bhattacharya and Andrew Stuart and Anima Anandkumar Neural operator: Graph kernel network for partial differential equations arXiv preprint arXiv:2003.03485 2020

[6] Zongyi Li and Nikola Kovachki and Kamyar Azizzadenesheli and Burigede Liu and Kaushik Bhattacharya and Andrew Stuart and Anima Anandkumar Fourier neural operator for parametric partial differential equations International Conference on Learning Representations 2020

[7] Tianping Chen and Hong Chen Universal approximation to nonlinear operators by neural networks with arbitrary activation functions and its application to dynamical systems IEEE Trans. Neural Netw. Learn. Syst. 1995 6 4 911–917

[8] Zongyi Li and Hongkai Zheng and Nikola Kovachki and David Jin and Haoxuan Chen and Burigede Liu and Kamyar Azizzadenesheli and Anima Anandkumar Physics-informed neural operator for learning partial differential equations ACM/IMS J. Data Sci. 2024 1 3 1–27

[9] Fabrice Rossi and Brieuc Conan-Guez Functional multi-layer perceptron: a non-linear tool for functional data analysis Neural Netw. 2005 18 1 45–60

[10] Lu Lu and Pengzhan Jin and Guofei Pang and Zhongqiang Zhang and George Em Karniadakis Learning nonlinear operators via DeepONet based on the universal approximation theorem of operators Nat. Mach. Intell. 2021 3 3 218–229

[11] Maziar Raissi and Paris Perdikaris and George E Karniadakis Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations J. Comput. Phys. 2019 378 686–707

[12] Aditi Krishnapriyan and Amir Gholami and Shandian Zhe and Robert Kirby and Michael W Mahoney Characterizing possible failure modes in physics-informed neural networks Advances in Neural Information Processing Systems 2021 34 26548–26560

[13] Sifan Wang and Xinling Yu and Paris Perdikaris When and why PINNs fail to train: A neural tangent kernel perspective J. Comput. Phys. 2022 449 110768

[14] Songming Liu and Hao Zhongkai and Chengyang Ying and Hang Su and Jun Zhu and Ze Cheng A unified hard-constraint framework for solving geometrically complex pdes Advances in Neural Information Processing Systems 2022 35 20287–20299

[15] Haixu Wu and Yuezhou Ma and Hang Zhou and Huikun Weng and Jianmin Wang and Mingsheng Long ProPINN: Demystifying propagation failures in physics-informed neural networks arXiv preprint arXiv:2502.00803 2025

[16] Arka Daw and Jie Bu and Sifan Wang and Paris Perdikaris and Anuj Karpatne Mitigating Propagation Failures in Physics-informed Neural Networks using Retain-Resample-Release (R3) Sampling International Conference on Machine Learning 2023 7264–7302 PMLR

[17] Sifan Wang and Shyam Sankaran and Paris Perdikaris Respecting causality for training physics-informed neural networks Comput. Methods Appl. Mech. Eng. 2024 421 116813

[18] Pratik Rathore and Weimu Lei and Zachary Frangella and Lu Lu and Madeleine Udell Challenges in Training PINNs: A Loss Landscape Perspective International Conference on Machine Learning 2024 42159–42191 PMLR

[19] Songming Liu and Chang Su and Jiachen Yao and Zhongkai Hao and Hang Su and Youjia Wu and Jun Zhu Preconditioning for physics-informed neural networks arXiv preprint arXiv:2402.00531 2024

[20] Chenhui Xu and Dancheng Liu and Amir Nassereldine and Jinjun Xiong Fp64 is all you need: Rethinking failure modes in physics-informed neural networks Advances in Neural Information Processing Systems 2025 38 142949–142970

[21] Zhiyuan Zhao and Xueying Ding and B. Aditya Prakash PINN The Twelfth International Conference on Learning Representations 2024

[22] Chenhui Xu and Dancheng Liu and Yuting Hu and Jiajie Li and Ruiyang Qin and Qingxiao Zheng and Jinjun Xiong Sub-Sequential Physics-Informed Learning with State Space Model International Conference on Machine Learning 2025 69507–69525 PMLR

[23] Harris Drucker and Yann Le Cun Improving generalization performance using double backpropagation IEEE Trans. Neural Netw. 1992 3 6 991–997

[24] Justin Sirignano and Konstantinos Spiliopoulos DGM: A deep learning algorithm for solving partial differential equations J. Comput. Phys. 2018 375 1339–1364

[25] Jiequn Han and Arnulf Jentzen and others Deep learning-based numerical methods for high-dimensional parabolic partial differential equations and backward stochastic differential equations Commun. Math. Stat. 2017 5 4 349–380

[26] M W M Gamini Dissanayake and Nhan Phan-Thien Neural-network-based approximations for solving partial differential equations Commun. Numer. Methods Eng. 1994 10 3 195–201

[27] Sifan Wang and Hanwen Wang and Paris Perdikaris On the eigenvector bias of Fourier feature networks: From regression to solving multi-scale PDEs with physics-informed neural networks Comput. Methods Appl. Mech. Eng. 2021 384 113938

[28] Levi D McClenny and Ulisses M Braga-Neto Self-adaptive physics-informed neural networks J. Comput. Phys. 2023 474 111722

[29] Gregory Kang Ruey Lau and Apivich Hemachandra and See-Kiong Ng and Bryan Kian Hsiang Low PINNACLE The Twelfth International Conference on Learning Representations 2024

[30] Viggo Moro and Luiz Chamon Solving differential equations with constrained learning International Conference on Learning Representations 2025 2025 44906–44948

[31] Jerry Liu and Yasa Baig and Denise Hui Jean Lee and Rajat Vadiraj Dwaraknath and Atri Rudra and Chris Ré BWLer: Barycentric Weight Layer Elucidates a Precision-Conditioning Tradeoff for PINNs arXiv preprint arXiv:2506.23024 2025

[32] Youngsik Hwang and Dong-Young Lim Dual cone gradient descent for training physics-informed neural networks Advances in Neural Information Processing Systems 2024 37 98563–98595

[33] Ian Goodfellow and Yoshua Bengio and Aaron Courville and Yoshua Bengio Deep learning MIT press Cambridge 2016 1 2

[34] Kevin P Murphy Probabilistic machine learning: an introduction MIT press 2022

[35] Anders Krogh and John Hertz A simple weight decay can improve generalization Advances in Neural Information Processing Systems 1991 4

[36] Robert Tibshirani Regression shrinkage and selection via the lasso J. R. Stat. Soc. Ser. B Stat. Methodol. 1996 58 1 267–288

[37]

Collapse and expand sections

Cross-references and related material

Discussions

Table of contents

Abstract

1 Introduction

1.1 Contributions.

2 Related Works

2.1 Physics Informed Neural Networks and Their Failure Modes

2.2 Regularization

3 Experimental Setup

3.1 Physics Informed Neural Networks

4 Failure Modes are Caused by Overfitting

4.1 Optimization Difficulty or Failure Mode

4.1.1 Convection Equation (FP32 Failure)

4.1.2 Wave Equation (64 bit failure)

4.2 Regularization Combined with Precision is Optimal

4.3 Optimizing the Performance of Double PINN

5 Regularization of PINNs

5.1 Regularization Strategies

6 Conclusions

A Experimental Setup

A.1 Network Setup

B Partial Differential Equations

B.1 1D Convection or Linear Advection

B.2 Wave Equation

B.3 Reaction Equation

B.4 Allen-Cahn Equation

C Limitations

D Broader Impacts

References

Discussion: create topic login to participate.

Dynamic display of documents.

Collapse and expand sections

Cross-references and related material

Discussions

Table of contents

Abstract

1 Introduction

1.1 Contributions.

2 Related Works

2.1 Physics Informed Neural Networks and Their Failure Modes

2.2 Regularization

3 Experimental Setup

3.1 Physics Informed Neural Networks

4 Failure Modes are Caused by Overfitting

4.1 Optimization Difficulty or Failure Mode

4.1.1 Convection Equation (FP32 Failure)

4.1.2 Wave Equation (64 bit failure)

4.2 Regularization Combined with Precision is Optimal

4.3 Optimizing the Performance of Double PINN

5 Regularization of PINNs

5.1 Regularization Strategies

6 Conclusions

A Experimental Setup

A.1 Network Setup

B Partial Differential Equations

B.1 1D Convection or Linear Advection

B.2 Wave Equation

B.3 Reaction Equation

B.4 Allen-Cahn Equation

C Limitations

D Broader Impacts

References

Discussion: create topic login to participate.