Hydra-LSTM: a semi-shared machine learning architecture for prediction across watersheds

Ruparell, Karan; Marks, Robert J.; Wood, Andy; Hunt, Kieran M. R.; Cloke, Hannah L.; Prudhomme, Christel; Pappenberger, Florian; Chantry, Matthew

Download

Preview

Text (Open Access)
- Published Version
· Available under License Creative Commons Attribution.

[thumbnail of Hydra_LSTM_Final_Version.pdf]

Text
- Accepted Version
· Restricted to Repository staff only
· The Copyright of this document has not been checked yet. This may affect its availability.

Advice

Please see our End User Agreement.

It is advisable to refer to the publisher's version if you intend to cite from this work. See Guidance on citing.

Tools

Lists

Ruparell, K., Marks, R. J., Wood, A., Hunt, K. M. R. ORCID: https://orcid.org/0000-0003-1480-3755, Cloke, H. L. ORCID: https://orcid.org/0000-0002-1472-868X, Prudhomme, C., Pappenberger, F. and Chantry, M. (2025) Hydra-LSTM: a semi-shared machine learning architecture for prediction across watersheds. Artificial Intelligence for the Earth Systems, 4 (3). ISSN 2769-7525 doi: 10.1175/AIES-D-24-0103.1

Abstract/Summary

Long Short Term Memory networks (LSTMs) are used to build single models that predict river discharge across many catchments. These models offer greater accuracy than models trained on each catchment independently, if the same variables are used as inputs for each catchment. However, the same data is rarely available for all catchments. This prevents the use of variables available only in some catchments, such as historic river discharge or upstream discharge. The only existing method that allows for optional variables requires all variables to be in the initial training of the model, limiting its transferability to new catchments. To address this limitation, we develop the Hydra-LSTM. The Hydra-LSTM is able to use some variables across all catchments to make predictions, and use further variables in other catchments where they are helpful and available. This allows general training and the use of catchment-specific data. The bulk of the model can be shared across catchments, maintaining the benefits of multi-catchment models to generalize while also benefiting from the using bespoke data. We apply this methodology to 2 day-ahead river discharge prediction in the Western US, a small enough time step to expect our models to be skilful and difficult enough to expect differences between models. We obtain more accurate quantile predictions than Multi-Catchment and Single-Catchment LSTMs while allowing forecasters to introduce and remove variables from their prediction set. We test the ability of the Hydra-LSTM to incorporate catchment-specific data, introducing historical river discharge as a catchment-specific input, outperforming other commonly used models.

Altmetric Badge

Dimensions Badge

Item Type	Article
URI	https://centaur.reading.ac.uk/id/eprint/123274
Identification Number/DOI	10.1175/AIES-D-24-0103.1
Refereed	Yes
Divisions	Science > School of Archaeology, Geography and Environmental Science > Department of Geography and Environmental Science Science > School of Mathematical, Physical and Computational Sciences > Department of Meteorology
Publisher	American Meteorological Society
Download/View statistics	View download statistics for this item

Download Statistics

Downloads

Downloads per month over past year

Deposit Details

CORE (COnnecting REpositories)

University Staff: Request a correction | Centaur Editors: Update this record

Date Deposited:	18 Jun 2025 09:31	Date item deposited into CentAUR
Last Modified:	08 Mar 2026 08:05	Date item last modified