Hydra-LSTM: a semi-shared machine learning architecture for prediction across watersheds

[thumbnail of Open Access]
Preview
Text (Open Access)
- Published Version
· Available under License Creative Commons Attribution.
[thumbnail of Hydra_LSTM_Final_Version.pdf]
Text
- Accepted Version
· Restricted to Repository staff only
· The Copyright of this document has not been checked yet. This may affect its availability.

Please see our End User Agreement.

It is advisable to refer to the publisher's version if you intend to cite from this work. See Guidance on citing.

Add to AnyAdd to TwitterAdd to FacebookAdd to LinkedinAdd to PinterestAdd to Email

Ruparell, K., Marks, R. J., Wood, A., Hunt, K. M. R. ORCID: https://orcid.org/0000-0003-1480-3755, Cloke, H. L. ORCID: https://orcid.org/0000-0002-1472-868X, Prudhomme, C., Pappenberger, F. and Chantry, M. (2025) Hydra-LSTM: a semi-shared machine learning architecture for prediction across watersheds. Artificial Intelligence for the Earth Systems, 4 (3). ISSN 2769-7525 doi: 10.1175/AIES-D-24-0103.1

Abstract/Summary

Long Short Term Memory networks (LSTMs) are used to build single models that predict river discharge across many catchments. These models offer greater accuracy than models trained on each catchment independently, if the same variables are used as inputs for each catchment. However, the same data is rarely available for all catchments. This prevents the use of variables available only in some catchments, such as historic river discharge or upstream discharge. The only existing method that allows for optional variables requires all variables to be in the initial training of the model, limiting its transferability to new catchments. To address this limitation, we develop the Hydra-LSTM. The Hydra-LSTM is able to use some variables across all catchments to make predictions, and use further variables in other catchments where they are helpful and available. This allows general training and the use of catchment-specific data. The bulk of the model can be shared across catchments, maintaining the benefits of multi-catchment models to generalize while also benefiting from the using bespoke data. We apply this methodology to 2 day-ahead river discharge prediction in the Western US, a small enough time step to expect our models to be skilful and difficult enough to expect differences between models. We obtain more accurate quantile predictions than Multi-Catchment and Single-Catchment LSTMs while allowing forecasters to introduce and remove variables from their prediction set. We test the ability of the Hydra-LSTM to incorporate catchment-specific data, introducing historical river discharge as a catchment-specific input, outperforming other commonly used models.

Altmetric Badge

Item Type Article
URI https://centaur.reading.ac.uk/id/eprint/123274
Identification Number/DOI 10.1175/AIES-D-24-0103.1
Refereed Yes
Divisions Science > School of Archaeology, Geography and Environmental Science > Department of Geography and Environmental Science
Science > School of Mathematical, Physical and Computational Sciences > Department of Meteorology
Publisher American Meteorological Society
Download/View statistics View download statistics for this item

Downloads

Downloads per month over past year

University Staff: Request a correction | Centaur Editors: Update this record