How do I know if my forecasts are better? Using benchmarks in Hydrological ensemble prediction

Pappenberger, F.; Ramos, M. H.; Cloke, H. L.; Wetterhall, F.; Alfieri, L.; Bogner, K.; Mueller, A.; Salamon, P.

Download

Preview

Text (Open Access)
- Published Version
· Available under License Creative Commons Attribution.

Text
- Published Version
· Restricted to Repository staff only
· Available under License Creative Commons Attribution.

Advice

Please see our End User Agreement.

It is advisable to refer to the publisher's version if you intend to cite from this work. See Guidance on citing.

Tools

Lists

Pappenberger, F., Ramos, M. H., Cloke, H. L. ORCID: https://orcid.org/0000-0002-1472-868X, Wetterhall, F., Alfieri, L., Bogner, K., Mueller, A. and Salamon, P. (2015) How do I know if my forecasts are better? Using benchmarks in Hydrological ensemble prediction. Journal of Hydrology, 522. pp. 697-713. ISSN 0022-1694 doi: 10.1016/j.jhydrol.2015.01.024

Abstract/Summary

The skill of a forecast can be assessed by comparing the relative proximity of both the forecast and a benchmark to the observations. Example benchmarks include climatology or a naïve forecast. Hydrological ensemble prediction systems (HEPS) are currently transforming the hydrological forecasting environment but in this new field there is little information to guide researchers and operational forecasters on how benchmarks can be best used to evaluate their probabilistic forecasts. In this study, it is identified that the forecast skill calculated can vary depending on the benchmark selected and that the selection of a benchmark for determining forecasting system skill is sensitive to a number of hydrological and system factors. A benchmark intercomparison experiment is then undertaken using the continuous ranked probability score (CRPS), a reference forecasting system and a suite of 23 different methods to derive benchmarks. The benchmarks are assessed within the operational set-up of the European Flood Awareness System (EFAS) to determine those that are ‘toughest to beat’ and so give the most robust discrimination of forecast skill, particularly for the spatial average fields that EFAS relies upon. Evaluating against an observed discharge proxy the benchmark that has most utility for EFAS and avoids the most naïve skill across different hydrological situations is found to be meteorological persistency. This benchmark uses the latest meteorological observations of precipitation and temperature to drive the hydrological model. Hydrological long term average benchmarks, which are currently used in EFAS, are very easily beaten by the forecasting system and the use of these produces much naïve skill. When decomposed into seasons, the advanced meteorological benchmarks, which make use of meteorological observations from the past 20 years at the same calendar date, have the most skill discrimination. They are also good at discriminating skill in low flows and for all catchment sizes. Simpler meteorological benchmarks are particularly useful for high flows. Recommendations for EFAS are to move to routine use of meteorological persistency, an advanced meteorological benchmark and a simple meteorological benchmark in order to provide a robust evaluation of forecast skill. This work provides the first comprehensive evidence on how benchmarks can be used in evaluation of skill in probabilistic hydrological forecasts and which benchmarks are most useful for skill discrimination and avoidance of naïve skill in a large scale HEPS. It is recommended that all HEPS use the evidence and methodology provided here to evaluate which benchmarks to employ; so forecasters can have trust in their skill evaluation and will have confidence that their forecasts are indeed better.

Altmetric Badge

Dimensions Badge

Item Type	Article
URI	https://centaur.reading.ac.uk/id/eprint/39072
Identification Number/DOI	10.1016/j.jhydrol.2015.01.024
Refereed	Yes
Divisions	Interdisciplinary Research Centres (IDRCs) > The Pearl Science > School of Archaeology, Geography and Environmental Science > Department of Geography and Environmental Science Interdisciplinary centres and themes > Soil Research Centre Science > School of Mathematical, Physical and Computational Sciences > Department of Meteorology
Uncontrolled Keywords	Hydrological ensemble prediction; Forecast performance; Evaluation; Verification; Benchmark; Probabilistic forecasts
Publisher	Elsevier
Download/View statistics	View download statistics for this item

Download Statistics

Downloads

Downloads per month over past year

Funded Project

Deposit Details

CORE (COnnecting REpositories)

University Staff: Request a correction | Centaur Editors: Update this record

Funders:	Natural Environment Research Council	The sponsoring bodies who contributed funding for the creation of this item. Example: NERC Example: The Royal Society of Chemistry A pick list of funders may appear as you type in the funder's name in full or as an acronym. Select a correct match to complete the field or type in a new entry in full. For new entries, the full name is preferred.
Projects:	Susceptibility of catchments to Intense rainfall and flooding (SINATRA) Funded by: Natural Environment Research Council (NE/K00896X/1 - £767,096) Local Lead (PI): Hannah Cloke Project Lead: Hannah Louise Cloke 19 April 2013 - 18 October 2016	Click Add to select your project (received at Reading) from an autocomplete list.

Date Deposited:	29 Jan 2015 11:55	Date item deposited into CentAUR
Last Modified:	15 Jun 2025 16:19	Date item last modified