Physics-aware multi-task learning for atmospheric turbulence parameterization: auxiliary tasks versus architectural conditioning

[thumbnail of pasc_2026_sambit_camera_ready_v1.pdf]
Text
- Accepted Version
· Restricted to Repository staff only
· The Copyright of this document has not been checked yet. This may affect its availability.

Please see our End User Agreement.

It is advisable to refer to the publisher's version if you intend to cite from this work. See Guidance on citing.

Add to AnyAdd to TwitterAdd to FacebookAdd to LinkedinAdd to PinterestAdd to Email

Panda, S. K., Jones, T. R. ORCID: https://orcid.org/0000-0002-7669-1499, Shahzad, M. ORCID: https://orcid.org/0009-0002-9394-343X, Lawrence, B. N. ORCID: https://orcid.org/0000-0001-9262-7860 and Ellis, A.-L. (2026) Physics-aware multi-task learning for atmospheric turbulence parameterization: auxiliary tasks versus architectural conditioning. In: PASC26 Conference, 29 Jun - 1 Jul 2026, Bern, Switzerland. doi: 10.1145/3815572.3815750 (In Press)

Abstract/Summary

Dynamic subgrid-scale (SGS) turbulence parameterizations in Large Eddy Simulation (LES) achieve superior physical fidelity but impose 2–4× computational overhead compared to static schemes, creating a critical bottleneck for high-resolution atmospheric modeling on HPC systems. Neural network based emulation offers a pathway to comparable accuracy at reduced computational cost, but realizing this potential requires architectures that generalize reliably across diverse atmospheric conditions and variable grid configurations. We systematically compare two physics-aware multi-task learning strategies for emulating Smagorinsky-based SGS closure in the UK Met Office NERC Cloud Model (MONC): a baseline approach using Richardson number prediction as auxiliary gradient regularization, and an Ri-conditioned approach that explicitly feeds predicted stability into coefficient (viscosity and diffusion) prediction heads. Evaluating 54 model configurations across three neural architectures (multi-layer perceptron (MLP), MLP with residual blocks (ResMLP) and Tabular Transformer (TabTransformer)) trained on mixed-resolution, multi-regime atmospheric data (66% coarse tropical convection, 34% fine shallow cumulus), we find that uncertainty-based task weighting consistently outperforms manual tuning and dynamic weighting alternatives. The simple MLPs with Richardson conditioning provide the best robustness-accuracy trade-off under distribution shift during inference, and the architectural complexity amplifies cross-regime failures despite improving in-distribution metrics. Notably, models maintain physical constraint compliance even when predictive accuracy degrades substantially, suggesting that the data coverage limitations, rather than any fundamental physics incompatibility, drive the cross-regime transfer failures. All results represent offline validation on static simulation data. Ongoing work focuses on online MONC integration to assess numerical stability, energy conservation, and computational performance under coupled feedback dynamics.

Altmetric Badge

Dimensions Badge

Item Type Conference or Workshop Item (Paper)
URI https://centaur.reading.ac.uk/id/eprint/129861
Identification Number/DOI 10.1145/3815572.3815750
Refereed Yes
Divisions Science > School of Mathematical, Physical and Computational Sciences > Department of Computer Science
Science > School of Mathematical, Physical and Computational Sciences > NCAS
Science > School of Mathematical, Physical and Computational Sciences > Department of Meteorology
Uncontrolled Keywords Large Eddy Simulation, Machine Learning, Turbulence Parameterization, Multi-Task Learning, Atmospheric Physics
Download/View statistics View download statistics for this item

University Staff: Request a correction | Centaur Editors: Update this record