Accessibility navigation


A Python interface to the Fortran-based Parallel Data Assimilation Framework: pyPDAF v1.0.2

Chen, Y. ORCID: https://orcid.org/0000-0002-2319-6937, Nerger, L. and Lawless, A. S. ORCID: https://orcid.org/0000-0002-3016-6568 (2025) A Python interface to the Fortran-based Parallel Data Assimilation Framework: pyPDAF v1.0.2. Geoscientific Model Development. ISSN 1991-9603 (In Press)

[thumbnail of egusphere-2024-1078-manuscript-version6 .pdf] Text - Accepted Version
· Restricted to Repository staff only
· The Copyright of this document has not been checked yet. This may affect its availability.

1MB

It is advisable to refer to the publisher's version if you intend to cite from this work. See Guidance on citing.

Abstract/Summary

Data assimilation (DA) is an essential component of numerical weather and climate prediction. Efficient implementation of DA algorithms benefits both research and operational prediction. Currently, a variety of DA software programs are available. One of the notable DA libraries is the Parallel Data Assimilation Framework (PDAF) designed for ensemble data assimilation. The DA framework is widely used with complex high-dimensional climate models, and is applied for research on atmosphere, ocean, sea ice and marine ecosystem modelling, as well as operational ocean forecasting. Meanwhile, there are increasing demands for flexible and efficient DA implementations using Python due to the increasing amount of intermediate complexity models as well as machine learning based models coded in Python. To accommodate for such demands, we introduce a Python interface to PDAF, pyPDAF. pyPDAF allows for flexible DA system development while retaining the efficient implementation of the core DA algorithms in the Fortran-based PDAF. The ideal use-case of pyPDAF is a DA system where the model integration is independent from the DA program, which reads the model forecast ensemble, produces an analysis, and updates the restart files of the model, or a DA system where the model can be used in Python. With implementations of both PDAF and pyPDAF, this study demonstrates the use of pyPDAF and PDAF in a coupled data assimilation (CDA) setup in a coupled atmosphere-ocean model, the Modular Arbitrary-Order Ocean-Atmosphere Model (MAOOAM). This study demonstrates that pyPDAF allows for PDAF functionalities from Python where users can utilise Python functions to handle case-specific information from observations and numerical model. The study also shows that pyPDAF can be used with high-dimensional systems with little slow-down per analysis step of only up to 13\% for the localized ensemble Kalman filter LETKF in the example used in this study. The study also shows that, compared to PDAF, the overhead of pyPDAF is comparatively smaller when computationally intensive components dominate the DA system. This can be the case for systems with high-dimensional state vectors.

Item Type:Article
Refereed:Yes
Divisions:Science > School of Mathematical, Physical and Computational Sciences > National Centre for Earth Observation (NCEO)
Science > School of Mathematical, Physical and Computational Sciences > Department of Mathematics and Statistics
Science > School of Mathematical, Physical and Computational Sciences > Department of Meteorology
ID Code:125042
Publisher:European Geosciences Union

University Staff: Request a correction | Centaur Editors: Update this record

Page navigation