Accessibility navigation


Hydra-LSTM: a semi-shared Machine Learning architecture for prediction across Watersheds

Ruparell, K., Marks, R., Wood, A., Hunt, K. ORCID: https://orcid.org/0000-0003-1480-3755, Cloke, H. ORCID: https://orcid.org/0000-0002-1472-868X, Prudhomme, C., Pappenberger, F. and Chantry, M. (2025) Hydra-LSTM: a semi-shared Machine Learning architecture for prediction across Watersheds. Artificial Intelligence for the Earth Systems. ISSN 2769-7525 (In Press)

[thumbnail of Hydra_LSTM_Final_Version.pdf] Text - Accepted Version
· Restricted to Repository staff only
· The Copyright of this document has not been checked yet. This may affect its availability.

2MB

It is advisable to refer to the publisher's version if you intend to cite from this work. See Guidance on citing.

Abstract/Summary

Long Short Term Memory networks (LSTMs) are used to build single models that predict river discharge across many catchments. These models offer greater accuracy than models trained on each catchment independently, if the same variables are used as inputs for each catchment. However, the same data is rarely available for all catchments. This prevents the use of variables available only in some catchments, such as historic river discharge or upstream discharge. The only existing method that allows for optional variables requires all variables to be in the initial training of the model, limiting its transferability to new catchments. To address this limitation, we develop the Hydra-LSTM. The Hydra-LSTM is able to use some variables across all catchments to make predictions, and use further variables in other catchments where they are helpful and available. This allows general training and the use of catchment-specific data. The bulk of the model can be shared across catchments, maintaining the benefits of multi-catchment models to generalize while also benefiting from the using bespoke data. We apply this methodology to 2 day-ahead river discharge prediction in the Western US, a small enough time step to expect our models to be skilful and difficult enough to expect differences between models. We obtain more accurate quantile predictions than Multi-Catchment and Single-Catchment LSTMs while allowing forecasters to introduce and remove variables from their prediction set. We test the ability of the Hydra-LSTM to incorporate catchment-specific data, introducing historical river discharge as a catchment-specific input, outperforming other commonly used models.

Item Type:Article
Refereed:Yes
Divisions:Science > School of Archaeology, Geography and Environmental Science > Department of Geography and Environmental Science
Science > School of Mathematical, Physical and Computational Sciences > Department of Meteorology
ID Code:123274
Publisher:American Meteorological Society

University Staff: Request a correction | Centaur Editors: Update this record

Page navigation