Hostname: page-component-cb9f654ff-mnl9s Total loading time: 0 Render date: 2025-08-29T15:54:00.890Z Has data issue: false hasContentIssue false

Global analysis of temporal clusters of storm surges

Published online by Cambridge University Press:  01 August 2025

Ariadna Martín*
Affiliation:
Department of Civil, Environmental and Construction Engineering, University of Central Florida, Orlando, FL, USA National Center for Integrated Coastal Research, University of Central Florida, Orlando, FL, USA
Robert Jane
Affiliation:
Department of Civil, Environmental and Construction Engineering, University of Central Florida, Orlando, FL, USA National Center for Integrated Coastal Research, University of Central Florida, Orlando, FL, USA
Alejandra R. Enriquez
Affiliation:
School of Geosciences, College of Arts & Sciences, University of South Florida, St Petersburg, FL, USA
Thomas Wahl
Affiliation:
Department of Civil, Environmental and Construction Engineering, University of Central Florida, Orlando, FL, USA National Center for Integrated Coastal Research, University of Central Florida, Orlando, FL, USA
*
Corresponding author: Ariadna Martín; Email: ariadna.martinoliva@ucf.edu
Rights & Permissions [Opens in a new window]

Abstract

Temporal storm surge clustering refers to a series of events affecting the same region within a short period of time, which can strongly influence coastal flooding impacts and erosion. Here, we analyze global storm surge clustering from tide gauges and a state-of-the-art global model hindcast to identify geographical hotspots of extreme storm surge clusters and assess event frequencies. We study the spatial distribution as well as the contribution of different event intensities to clustering. On average, globally, 92% of coastal locations show significant temporal clustering for 1-year return period events, and 25% for 5-year return level events, although notable spatial differences exist. Our results reveal two distinct clustering regimes: (i) short timescale clustering, where events occur in rapid succession (intra-annual), and (ii) long timescales (inter-annual), providing varying recovery times between events. We also test the validity of assuming a Poisson distribution, commonly used in storm surge frequency analyses. Our results show that >80% of the stations analyzed do not follow a Poisson distribution, at least when including events that are not the most extreme but exceeded, for example, the 1-year return level. These findings offer insights into temporal clustering dynamics of storm surges and their implications for coastal hazard assessments.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press

Impact Statement

Understanding when and where coastal storm surges tend to cluster over time is essential for improving disaster preparedness and risk management. This study identifies global hotspots where multiple extreme storm surge events can occur within short timeframes, increasing the risk of compounded impacts on communities, infrastructure and ecosystems. The findings challenge the traditional assumption that such events happen randomly and independently, showing instead that certain regions face higher risks due to clustering. These insights can help stakeholders by incorporating temporal clustering into coastal risk assessments, which can lead to more effective and resilient coastal management strategies.

Introduction

Coastal flooding is among the most devastating natural hazards, causing substantial economic losses and human casualties worldwide. While such floods often result from the combination of multiple processes (e.g., tides, rainfall, wind-wave effects and storm surge), high storm surges are a key contributing factor. Here, we refer to storm surges as events where coastal sea levels are substantially higher than predicted as a response to changes in the mean sea-level pressure and wind-driven setup. Reliable estimates of the occurrences of these extreme events are essential for risk assessment, insurance companies and coastal communities. An emerging area of concern is the temporal clustering of apparently independent extreme storm surge events (i.e., when multiple events occur in quick succession rather than being randomly spaced over time). Temporal clustering may have important consequences for coastal infrastructure and ecosystems as recovery times between events are minimized, and cumulative effects compound the risk, especially when communities are still recovering from previous shocks (Zscheischler et al., Reference Zscheischler, Martius, Westra, Bevacqua, Raymond, Horton, van den Hurk, AghaKouchak, Jézéquel, Mahecha, Maraun, Ramos, Ridder, Thiery and Vignotto2020).

In September 2017, Hurricanes Irma and Maria impacted Puerto Rico within 2 weeks, causing severe winds, rainfall and storm surges. Similarly, the United Kingdom experienced a series of storms during the winter of 2013/2014, with an average recurrence of 2.5 days (Jenkins et al., Reference Jenkins, Haigh, Camus, Pender, Lamb, Trust and Kassem2022). More recently, in 2024, Hurricanes Helene and Milton impacted the Florida Gulf Coast in quick succession, mirroring previous cases such as Ian and Nicole (2023) and Charley and Jeanne (2004), where subsequent storms exacerbated damage in already affected areas. Figure 1 shows how the season of 2005 affected the US Gulf Coast with hurricanes and tropical cyclones and their associated storm surges. While not all clustered events are extreme, moderate storms can still cause significant damage, particularly when flood defenses and coastal protections are already weakened. Karunarathna et al. (Reference Karunarathna, Pender, Ranasinghe, Short and Reeve2014) found that beach erosion from clusters of moderate storms can resemble the erosion caused by a single extreme event. Similarly, Macamo et al. (Reference Macamo, Massuanganhe, Nicolau, Bandeira and Adams2016) observed that consecutive events can slow down the recovery process of mangrove forests, exacerbating the natural process of erosion and sedimentation in those areas.

Figure 1. Tropical storms and hurricanes that produced storm surges at the Apalachicola River tide gauge (red star), Florida, in 2005. The tracks of the tropical storms and hurricanes are shown in dashed lines with different colors. The inset plot indicates the inter-arrival times between these different storm surge events.

In some cases, consecutive events can have less detrimental long-term effects. For example, when tropical cyclones occur within a season and are not scattered throughout the year. This creates an extended calm period, giving coral reefs and other natural ecosystems more time to recover than they would under a more random distribution of events with the same rate of occurrence (Mumby et al., Reference Mumby, Vitolo and Stephenson2011; Jagger and Elsner, Reference Jagger and Elsner2012). This underscores the need to accurately predict both short-term clustering and longer recovery periods between storm clusters.

Despite extensive research on individual storm surge events, there is limited understanding of the temporal clustering patterns and their consequences. This recurrence of extreme events illustrates long-term problems with the miscommunication of risk to the public and decision-makers (Towe et al., Reference Towe, Tawn, Eastoe and Lamb2020). Traditionally, event frequency has been modeled using the Poisson distribution (Jagger and Elsner, Reference Jagger and Elsner2012), assuming independent and identically distributed events where event inter-arrival times are exponentially distributed. For example, the National Oceanic and Atmospheric Administration (NOAA) Experimental Long Lead Seasonal Hybrid Hurricane Forecast System, which uses a Poisson regression model based on Villarini et al. (Reference Villarini, Vecchi and Smith2010). However, historically observed clustering, including those already discussed, suggests a higher probability of shorter inter-arrival times than expected under an exponential distribution. This could lead to a systematic underprediction of both the number of events per season and the frequency of inactive seasons (Jagger and Elsner, Reference Jagger and Elsner2012). Storm surge events impact a wide range of natural ecosystems, including beaches, dunes, as well as public infrastructure and coastal communities. The recovery time of these systems varies significantly, ranging from months to decades, and is highly site-specific (Dollar and Tribble, Reference Dollar and Tribble1993; Morton et al., Reference Morton, Paine and Gibeaut1994; Hamideh et al., Reference Hamideh, Peacock and Van Zandt2018; de Ruiter et al., Reference de Ruiter, Couasnon, van den Homberg, Daniell, Gill and Ward2020). Back-to-back events can slow down these recovery times by exacerbating the impacts on these areas; therefore, the definition of a cluster is highly dependent on the system as well as the area of study and can substantially influence the results of the analysis.

Therefore, in this article, we present a comprehensive global analysis of storm surge clustering using a wide range of clustering definitions. We test the validity of the Poisson assumption under those definitions and assess how event intensity influences clustering patterns. Additionally, we analyze inter-arrival times, distinguishing between short-term clustering and calm periods (recovery time). The results provide critical insights into the temporal structure of storm surge events, contributing to improved risk assessment and coastal adaptation strategies.

The article is organized as follows: the second section describes the datasets and methodology. In the third section, we present the results, including model validation against tide gauge observations (“The role of clustering definition and validation of model hindcast data” section), the application of the methodology at the global scale (“Global analysis” section) and an analysis of inter-arrival times (“Inter-arrival time analysis” section). The findings are discussed in the fourth section, and conclusions are provided in the fifth section.

Data and methods

Data

To achieve a consistent spatial–temporal resolution of coastal sea level at the global scale, we use model-based sea-level data from the Coastal Dataset for the Evaluation of Climate Impact 2020 (CoDEC) (Muis et al., Reference Muis, Apecechea, Dullaart, de Lima Rego, Madsen, Su, Yan and Verlaan2020, available at the Copernicus Climate Change Service [2023]). Time series of sea level and tides are computed using the Deltares Global Tide and Surge Model (GTSMv3.0), a hydrodynamic model that dynamically simulates water levels at 10-min intervals using forcing fields from the ERA5 global reanalysis (Hersbach et al., Reference Hersbach, Bell, Berrisford, Hirahara, Horányi, Muñoz-Sabater, Nicolas, Peubey, Radu, Schepers, Simmons, Soci, Abdalla, Abellan, Balsamo, Bechtold, Biavati, Bidlot, Bonavita, De Chiara, Dahlgren, Dee, Diamantakis, Dragani, Flemming, Forbes, Fuentes, Geer, Haimberger, Healy, Hogan, Hólm, Janisková, Keeley, Laloyaux, Lopez, Lupu, Radnoti, de Rosnay, Rozum, Vamborg, Villaume and Thépaut2020) from 1979 to 2018. The dataset includes 18,719 grid points along a smoothed global coastline, with a spatial resolution of 10–50 km. The nontidal residual time series (henceforth referred to as storm surge) are derived by subtracting the tides from the sea-level data.

We assess the model’s ability to represent temporal clustering using in situ observations from the Global Extreme Sea Level Analysis database–version 3 (GESLA-3) (Woodworth et al., Reference Woodworth, Hunter, Marcos, Caldwell, Menéndez and Haigh2016; Haigh et al., Reference Haigh, Marcos, Talke, Woodworth, Hunter, Hague, Arns, Bradshaw and Thompson2022). A total of 527 coastal tide gauges from GESLA are used, those with at least 20 years of records between 1979 and 2018. To ensure consistency, gaps in the tide gauge storm surge data are mirrored in the CoDEC time series, and the mean sea-level variation is removed using a 30-day moving window, which leaves the seasonality due to atmospheric conditions (also captured by the hydrodynamic model used to produce the CoDEC data). The astronomical tides are calculated using the MATLAB U-Tide package (Codiga, Reference Codiga2025) on a year-by-year basis, for years with more than 70% of data availability. U-tide is applied using the least-squares method, white noise floor assumption for confidence intervals and an automated choice of constituents, resulting in 67 tidal constituents per year on average. Seasonal tidal components (i.e., solar annual (SA) and solar semi-annual (SSA)) were excluded from the harmonic analysis to preserve seasonality, which is not already removed by the 30-day moving average. The tidal analysis was performed on 369-day periods (12.5 lunar months) to consider a period close to multiples of the synodic periods of the short-term tidal constituents.

The independent storm surge events are identified following a declustering process set out in Martín et al. (Reference Martín, Wahl, Enriquez and Jane2024). This method identifies a site-specific standard event duration, which is used to isolate independent extreme events. The method accounts for the temporal and spatial variability of the storms. To decluster the time series, a threshold is required. By varying this threshold, we evaluate the sensitivity of our results to different levels of extremeness. Specifically, we used five return levels (1–5 years), determined by fitting independent events to a generalized Pareto distribution.

Methods

Usually, independent events are considered part of the same cluster if their inter-arrival time is shorter than a predefined time window. We refer to this time window as the “clustering window” hereinafter. To assess the sensitivity of results to different clustering window definitions, we use 14 different windows ranging from 15 days to 2 years, allowing us to explore the temporal structure of clustering within this period (exact values are: 15, 30, 45, 60, 75, 90, 120, 150, 180, 240, 300, 365, 545 and 730 days).

Another important factor is the severity of those consecutive events. Return levels are commonly used to quantify risk over a given period. For example, a 1-year return level represents an event expected to occur (or be exceeded) annually on average. However, clustering remains possible, leading to years with multiple exceedances above the 1-year return level threshold and others with none. To align with traditional risk threshold definitions, we use return levels to represent different intensity thresholds. Specifically, we apply five thresholds, corresponding to 1- to 5-year return levels, to determine the number of events included in the analysis (see Supplementary Figure S1).

Stations with significant clustering violate the Poisson assumption, indicating storm surge clustering that cannot be explained by a Poisson process. In contrast, stations with no significant clustering may still exhibit clustering patterns, but consistent with a Poisson distribution. We test whether storm surges follow a Poisson-distributed event frequency by using Ripley’s K metric (Ripley, Reference Ripley1981), which indicates the tendency for clustering within a time series. It is defined as the average number of events within a time window (i.e., our clustering window) around another event in the same time series. The higher the value, the more clustering occurs in that time series. We calculate this metric for all tide gauge stations and CoDEC grid points and assess statistical significance using bootstrap simulations, similar to the approach by Brunner and Stahl (Reference Brunner and Stahl2023). Specifically, we generate 1,000 homogeneous Poisson-distributed binary time series with the same rate of occurrence as the observed time series. Values of the Ripley’s K index, for the observed time series, above the 98th percentile of the index calculated for the 1,000 Poisson-distributed series are deemed to indicate statistically significant clustering.

Results

The role of clustering: Definition and validation of model hindcast data

We start by testing the sensitivity of clustering against different clustering definitions and levels of extremeness, as well as comparing results from model hindcasts to those derived from observations. Our results indicate that extreme storm surges cluster in time at most sites, for both tide gauge observations and modeled data. The level of clustering, however, varies with the threshold. Figure 2 compares the percentage of time series exhibiting temporal clustering (Figure 2a,b) with the percentage of time series where the clustering is deemed significant, that is, the clustering behavior deviates from the Poisson distribution (Figure 2c,d). The analysis is performed using observations from GESLA (Figure 2a,c) and CoDEC model points (Figure 2b,d). Validation includes 527 tide gauges and their corresponding CoDEC points, using 14 clustering window definitions and 5 different thresholds.

Figure 2. Identified clusters of storm surges at 527 coastal sites using varying thresholds and clustering windows. (a, b) Percentage of locations where clustering occurred based on GESLA (a) and CoDEC (b) data. (c, d) Percentage of locations where clustering behavior differs significantly from the assumption of a Poisson distribution for GESLA (c) and CoDEC (d) data.

Figure 2a,b show the percentage of stations (out of the 527 selected) that exhibit clustering at least once in the time series. More stations lack clusters at higher thresholds (going from 1 to 5-year return levels), which is to be expected due to fewer events exceeding these thresholds. As clustering windows lengthen, more events cluster, which is also to be expected, but the relationship is nonlinear. Particularly, for higher thresholds, less clustering occurs when using 200 days as the clustering window. When focusing on stations with significant clustering (Figure 2c,d), lower thresholds yield more significant clustering due to increased event counts. The number of stations with significant clustering decreases with increasing thresholds, but never falls below 20%, that is, at least 20% of stations exhibit significant clustering at any threshold. Peaks occur around 90- to 120-day windows, declining until 1-year windows, which indicates that there are certain windows when significant clustering is more likely.

The differences between general occurrences of clustering (i.e., where some clustering was observed) and occurrences of significant clustering (i.e., clustering is statistically significantly different compared to a Poisson distribution) (Figure 2ad, respectively) indicate the number of stations where clustering happens but in a way that aligns with a Poisson process. For example, at the 1-year return level, nearly all stations show clustering, and 70% (across all window definitions) exhibit statistically significant clustering. Meaning only ~30% of those time series exhibit clustering behavior that follows a Poisson distribution. For instance, at a specific station in Brest (France), we derive a K value of 1.7 from the observations (for the 1-year return level and 365-day clustering window), exceeding the 98th percentile of the Poisson-based bootstrap distribution (equivalent to a K value of 1.4), indicating statistically significant overdispersion (P ≈ 0.01). Similar patterns are observed at 95% of stations, where the likelihood of such clustering occurring under a Poisson process is generally below 1%.

Moreover, our results indicate that the CoDEC data (Figure 2b,d) is overall able to reproduce the temporal clustering of storm surges observed in GESLA (Figure 2a,c). These differences are spatially explored below. The same results as shown in Figure 2b,d, but obtained using all the model points included in the CoDEC dataset, can be found in Supplementary Figure S4, showing overall similar behavior.

Next, we examine how temporal clusters vary from one location to another. Figure 3 shows the percentage of clustering, defined as the number of clusters divided by the total number of events, for the 1 and 3-year return levels, again for GESLA (Figure 3a,c) and CoDEC (Figure 3b,d). We use a 90-day clustering window definition based on the peaks of significant clustering in Figure 2c,d.

Figure 3. Percentage of clustering (i.e., number of clusters divided by the number of events) when using a 90-day clustering window and a 1- (a, b) and 3-year (c, d) thresholds to identify storm surge events. Results are shown for GESLA (a, c) and CoDEC (b, d) data.

Higher clustering values, up to 70%, exist at the 1-year return level in regions like northern Europe and northern Australia. In other words, when a storm surge exceeds the given threshold in those regions, 70% of the time another event occurs within <90 days. A latitudinal difference is observed in Australia and Japan, with equatorial regions showing higher clustering values. For the 1-year return level (Figure 3a,b), stations without significant clustering (no black contour) are found in northern Japan and parts of the US East Coast. For the 3-year return level (Figure 3c,d), stations without significant clusters are distributed more evenly across coastlines. Results may vary depending on the threshold or clustering definition (see Supplementary Figure S5). Considering longer windows leads to a higher percentage of clustering, particularly at the lowest threshold, whereas at the highest threshold (5-year return level), results are more consistent across different clustering window definitions.

Results from CoDEC (Figure 3b,d) resemble those obtained from GESLA (Figure 3a,c), with mean (median) differences (i.e., absolute difference in the percentage of clusters) of 10% (7.5%) for the 1-year and 12% (10%) for the 3-year return level, and standard deviations between stations of 8 and 11%, respectively. More than half of the stations, 62% for the 1 year and 52% for the 3 years, show differences (in the percentage of clusters) below 10%. Some differences persist in the Mediterranean Sea and some parts of the Baltic Sea. However, the largest discrepancies appear to be randomly distributed, suggesting they are likely due to site-specific characteristics rather than a systematic bias. Such characteristics can include influences of freshwater discharge, which would be captured by tide gauges but not in the model data, or differences in the predicted tides (which are calculated using harmonic analysis for the GESLA data and predicted by a hydrodynamic model in the case of the CoDEC data). However, on a global scale, these differences are minor, and the spatial clustering patterns are consistent across both datasets.

Finally, the same analysis was repeated using skew surge data (calculated for all 527 stations, both for tide gauges and CoDEC time series), where skew surge represents the difference between an observed high water and the closest predicted high tide. This analysis is to ensure that our findings are robust against the storm surge definition and not substantially influenced by tide–surge interaction. Supplementary Figures S2 and S3 show that the results are consistent regardless of the definition.

Global analysis

After validating CoDEC against observations, we identify the temporal clusters of storm surges at the global scale, using all 18,719 CoDEC coastal model points. To be consistent, we show the results from defining clusters as those events occurring within a 90-day time window; results for other cluster definitions (time windows and thresholds) are shown in Supplementary Figure S5. On average, 39% of the 1-year return level events globally were part of a cluster (Figure 4a). This value drops to roughly 10% when calculated using the bootstrap method explained in “Methods” section (consisting of 1,000 Poisson time series with the same rate of occurrence as the observations). Storm surges around Oceania, the Baltic Sea, and the Caribbean Sea exhibit a higher tendency to cluster, where, in some cases, up to 70% of the events were part of a cluster. Overall, these clusters (defined by <90 days and a 1-year return level threshold) contain an average of 2.5 events, with some regions, such as the Baltic Sea and parts of Indonesia, reaching up to 4 events on average. However, at most stations (~66%), clusters typically consist of just two events. Similar to what was shown before, fewer temporal clusters occur when higher thresholds are considered. On average, 14% of the 5-year return level events were part of a cluster, while the average value obtained using a Poisson distribution would be 2.5%. As with the 1-year return level events, Oceania, the Baltic Sea, and the Caribbean Sea show a higher tendency for clustering with values up to 60% (Figure 4b). Overall, temporal clustering above the 1-year return level is significant at 92% of the model points, decreasing to 25% for the 5-year return level (see Supplementary Figure S6). Places with no significant clustering are mainly found in the northern part of Japan and the East coast of South America (mostly Argentina and Uruguay), where clustering values are below 20%.

Figure 4. Percentage of clustering, defined as the number of clusters divided by the number of events, for clusters of <90 days. For events above the 1- (a) and 5-year (b) return levels, for all CoDEC coastal grid points.

So far, we have examined how clustering varies across different return levels. Next, we examine the composition of these clusters, specifically the contribution of different return level events to clusters defined as ≤90 days apart. Since a single cluster can consist of events with different intensities (e.g., one exceeding the 3-year return level and another the 5-year return level), analyzing this composition helps to better characterize the nature of clustered events. To quantify the contribution of different event intensities to clustering, we use the cumulative distribution function of the percentage of times different events with different return periods were part of a cluster (Figure 5a) versus how often such events occurred without being part of a cluster (Figure 5b). For example, when two events cluster (inter-arrival time ≤ 90 days), at 50% of the coastal points where that happened, at least 50% of the clustered events fall within the 1- to 2-year return level range (blue line in Figure 5a). The remaining contributions come from events reaching the 2- to 3-year return level (15%), 3- to 4-year return level (7%), 4- to 5-year return level (4%) and ≥5-year return level (21%). This distribution reflects the average contribution of events above certain thresholds to clustering.

Figure 5. Cumulative distribution function of the contribution of different return levels to (a) clusters and (b) the contribution of events that are not part of clusters. The cluster definition used is ≤90 days, and colors indicate the intensity. Panel (c) shows the contribution of events below (purple) and above (green) a 5-year return level to cluster with another 5-year return level (or higher) within 90 days.

As expected, the lower return level range (1–2 years) contributes the most to clustering. As return levels increase, the curves move to the left, indicating decreasing contributions to clustering. The curve representing the 5-year return level is an exception, and this is because all events that reach or exceed that return level are included here, and not just the ones falling into a narrow band of two different return levels, as is the case for the other return levels analyzed here. Hence, this category shows a significant influence, ranking second in contribution after the 1- to 2-year return level range. Figure 5b presents the same analysis for nonclustered events (i.e., the number of times 1- to 2-year events, e.g., were not part of any cluster). The results closely resemble those in Figure 5a, indicating that there is no systematic trend in the relative contribution of events with certain intensities to clustering versus nonclustering.

As shown in Figure 5a, events exceeding a 5-year return level also contribute substantially to clustering. To further examine this, Figure 5c focuses on cases where at least one event within a cluster is at or above the 5-year return level. This analysis quantifies the percentage of times (for each grid point) that a ≥5-year return level event clusters with a smaller event (purple line) versus when it clusters with another event at or above the 5-year return level (green line). Results indicate that for at least 50% of the coastal points, events exceeding the 5-year return level always cluster with lower-threshold events. In the remaining 50% of the points, the contribution of high-intensity events, while not zero, remains low. On average, 87% of these clusters occur with a lower-threshold event, while only 13% involve another event at (or above) the 5-year return level.

Inter-arrival time analysis

Examining the inter-arrival times between events allows us to provide a spatial representation of the average time between events globally. Also, as suggested by numerous studies (Dollar and Tribble, Reference Dollar and Tribble1993; Merz et al., Reference Merz, Nguyen and Vorogushyn2016; Besio et al., Reference Besio, Briganti, Romano, Mentaschi and De Girolamo2017), the time between two consecutive clusters can be as important as the time between consecutive events; it depends on the type of impact one is concerned about. We can distinguish these two stages of clustering as short-term clustering and recovery time. Short-term clustering refers to periods where events occur in rapid succession, leaving little to no time for recovery between them. In contrast, recovery time represents the longer intervals between clusters rather than individual events. This phase provides a critical window for environments to regenerate, communities to rebuild and infrastructure to be restored before the next clustering period begins. We analyze both stages independently by splitting the inter-arrival time analysis into two. The stages are defined using 180 days as boundary (i.e., inter-arrival times ≤ 180 days become part of the short-term clustering, and we define recovery times as the periods between events when they are more than 180 days apart); the 180 days threshold was chosen based on a shift in the clustering behavior in Figure 2 around that value, which generally aligns with the typical duration of stormy seasons (e.g., North Atlantic hurricane season from June to November, or the stormy season in Europe from October to March). Figure 6 focuses on the short-term clustering, using the median (Figure 6a) and standard deviation (Figure 6b) of inter-arrival times shorter than 180 days, for events exceeding the 1-year return level. The shortest values occur around Europe and southern Asia, with localized minima in the Southern Adriatic Sea, Java (Indonesia) and northern Madagascar, where values fall below 1.5 days, commonly used as an independence window in storm surge studies. However, such short intervals represent <6% of the model points. Other notable regions, with short inter-arrival times, include the Caribbean islands and northern Australia, with values around 15 days and standard deviations in the order of 20 days. The average inter-arrival time across all coastal points is 26 days, with a standard deviation of 90 days. By contrast, estimating the average inter-arrival time using an exponential decay model with the same rate of occurrence (i.e., assuming a Poisson distribution) gives a value of 90 days, with minimal variation between stations.

Figure 6. Median (a) and standard deviation (b) of the inter-arrival times shorter than 180 days for all CoDEC coastal grid points and events above the 1-year return level.

Figure 7 examines the time periods without clusters (in years; Figure 7a) and the recovery times (inter-arrival times >180 days; Figure 7b). Figure 7a shows the number of years without events, that is, inter-arrival times longer than >360 days (twice the threshold we use to separate short-term clusters); it is noteworthy that years do not need to be consecutive (i.e., a 360-day period with events can occur between two 360-day periods without events). Overall, values show little variability, averaging 13 years (± 2.5 years) without events, over the 40-year record. Only 17% of points exceed 15 years, and just 0.15% reach 20 years. The lowest values are found in the Black Sea, where values drop below 5 years. Figure 7b shows the median recovery time, which for 91% of the stations is <2 years. The average time between events is 1.5 years, with higher values where clustering occurs more often (Baltic Sea and the Sunda Islands). This indicates the time window that communities and natural ecosystems have, on average, to recover.

Figure 7. Number of years without any events above the 1-year return level (a) and median time between events when they occur >180 days apart from each other (which we define as recovery time).

Discussion

This study presents a global analysis of the temporal clustering of storm surges. We identified the temporal clusters using tide gauge observations, which are spatially sparse and discontinuous but provide accurate in situ sea-level information. We also use a global reanalysis dataset, which allows us to identify the temporal clusters of storm surges everywhere along the global coasts.

The CoDEC model validation shows strong agreement with tide gauges from GESLA. We hypothesize that discrepancies arise primarily in regions where external factors, such as freshwater discharge or tidal differences, influence surge behavior. Additionally, uncertainties may stem from the declustering process or limitations in model-based time series, but these remain within reasonable margins for a global-scale study.

The validation was also extended to different definitions of clustering, which can significantly influence the results. The selection criteria for specific clustering definitions are largely dependent on the local characteristics of the place of study (de Ruiter et al., Reference de Ruiter, Couasnon, van den Homberg, Daniell, Gill and Ward2020); therefore, given the global scope of this analysis, selecting a unique fixed window was not appropriate. Instead, we selected a range of clustering windows supported by literature and assessed how clustering behavior varies across different definitions (following other studies such as Santos et al., Reference Santos, Haigh and Wahl2017; Jenkins et al., Reference Jenkins, Haigh, Camus, Pender, Lamb, Trust and Kassem2022; Brunner and Stahl, Reference Brunner and Stahl2023). Moreover, the results depend on the type of events considered. While extreme events have been extensively studied due to their high impact, moderate events are often overlooked. However, they play a key role in clustering dynamics, as their short recurrence intervals amplify cumulative impacts. The inclusion of these events is essential for a comprehensive understanding of storm surge clustering and its implications (Towe et al., Reference Towe, Tawn, Eastoe and Lamb2020). Accordingly, our analysis focuses on a range of low to moderate return levels (1- to 5-year return levels) while still accounting for the most extreme events, as those exceeding the 5-year return levels are also included. Results indicate (Figure 2) that the cluster frequency decreases as event intensity increases, with clustering behavior strongly influenced by the time window used for defining clusters.

Storm surge event frequency has traditionally been modeled using the Poisson distribution. However, the presence of clustering challenges the suitability of this assumption. To assess the Poisson assumption, we compare the observed clustering (Figure 2a,b) to the clustering expected under a Poisson process with the same occurrence rate (Figure 2c,d). While the Poisson distribution performs better at higher thresholds, it fails to adequately represent clustering in at least 20% of stations, regardless of the chosen time window. This discrepancy increases to over 80% as the threshold decreases and more events are included.

Additionally, we observe a sharp decline in significant clustering between 120 days and 1 year, suggesting a shift in clustering behavior. Beyond 1 year, the clustering stabilizes; this could be linked to the fact that when considering a 1-year window (or longer), we capture two or more storm seasons in certain areas, resulting in a more consistent number of events within the clustering window. Shorter clustering windows exhibit greater variability, with peaks in significant clustering around 90–120 days, indicating that clustering dynamics are highly sensitive to the chosen definition. Overall, the results highlight the need for alternative approaches to model event frequencies, particularly when moderate events are considered. These events can be as impactful as extreme ones when the recovery time between them is insufficient (Karunarathna et al., Reference Karunarathna, Pender, Ranasinghe, Short and Reeve2014).

Importantly, while the 1-year return level defines the minimum event intensity included, clusters can involve higher-threshold events, which can create greater damage. We quantify the contribution of different return levels to these clusters (Figure 5). Lower-threshold events (1-year return level) show the highest contribution to clustering, and contributions decline while increasing the return levels, except for the 5-year return level, which encompasses all higher events and thus increases again. These results indicate that extreme events make a considerable contribution to clustering. While this study primarily focuses on low to moderate return levels, the clustering of more extreme events remains crucial. We perform a detailed assessment of high-threshold clustering, imposing an additional criterion requiring at least one event at or above the 5-year return level. The contribution is further divided into events above or below the 5-year return level that cluster with another event exceeding this threshold. Our results indicate that ~63% of the stations experienced two events (at or above a 5-year return level) within 90 days at least once over the study period (i.e., 63% of the stations experience at least one cluster formed by two events at or above a 5-year return level). On those stations, on average, this is the composition of 22% of the clusters (<90 days). Overall, the lowest return levels contribute the most to clustering, with very little contribution from other high return level events clustering together (within 90 days).

The widespread presence of clustering across nearly all stations suggests that assuming the frequency of storm surge events follows a Poisson distribution may not be appropriate. Instead, alternative models should be considered to account for clustering. For instance, Jagger and Elsner (Reference Jagger and Elsner2012) proposed a statistical model based on the Poisson distribution with an extra parameter to account for clustering. The main idea is that while events can cluster, each cluster remains independent of the others and follows a Poisson distribution. The statistical implications are the existence of two important “windows” in the event frequency distribution: (1) the clustering size or short-term clustering, and (2) the recovery time, which follows an exponential decay.

We present a quantitative approach to this concept, studying the inter-arrival time distributions. Short-term clustering (≤180 days, Figure 6) reveals high event recurrence, with a median inter-arrival time of 26 days and notable regional variability (standard deviation: 90 days). The lowest values are found in Europe, the Philippines, and the Caribbean islands. Clustering implies, among other things, the existence of calm periods in between. This recovery time has been extensively studied, compared to short-term clusters, particularly in natural environments, due to its critical role in the ecosystem growth and recovery from extreme events (Mumby et al., Reference Mumby, Vitolo and Stephenson2011; Karunarathna et al., Reference Karunarathna, Pender, Ranasinghe, Short and Reeve2014; Yao et al., Reference Yao, Fu, Liu, Li, Wang, Zhan, Wang and Gao2022). For these longer inter-arrival times (>180 days, Figure 7), we find that, on average, CoDEC stations experience calm periods of ~1.5 years between clusters. The fact that the average recovery time is longer than a year means there are some years that do not experience any event; on average, those represent 32% of the study period (i.e., 13 out of 40 years exhibiting no events).

While our study focuses on identifying where storm surge clustering occurs, understanding the underlying mechanism of why this happens remains an open question. Most studies aiming to understand the mechanisms behind storm clustering have focused on cyclone genesis and track behavior. On daily timescales, clustering is often linked to the development of cyclone families or secondary cyclogenesis, described by Bjerknes and Solberg, (Reference Bjerknes and Solberg1922). In these cases, storms are not independent; rather, secondary cyclones form along the frontal boundaries of preexisting primary cyclones, resulting in successive storm events. However, not all storm clusters can be explained by the cyclone family mechanism. As noted by Dacre and Pinto (Reference Dacre and Pinto2020), storms may also cluster purely by chance. Even when cyclone occurrences follow a random process (e.g., Poisson), some clusters are statistically expected to occur.

On longer timescales, storm clustering can be modulated by large-scale modes of climate variability. Many studies have used these teleconnection patterns to estimate the likelihood of active or inactive storm seasons, often using regression models (e.g., see Mailier et al., Reference Mailier, Stephenson, Ferro and Hodges2006; Vitolo et al., Reference Vitolo, Stephenson, Cook and Mitchell-Wallace2009; Economou et al., Reference Economou, Stephenson, Pinto, Shaffrey and Zappa2015). However, despite growing interest, much of our current understanding of serial cyclone clustering remains centered on the North Atlantic and European sectors (Dacre and Pinto, Reference Dacre and Pinto2020; Xi and Lin, Reference Xi and Lin2021; Xi et al., Reference Xi, Lin and Gori2023).

More recently, research has begun to explore storm clustering from a storm surge perspective, linking successive storm surge events to large-scale ocean–atmosphere interactions (Jenkins et al. Reference Jenkins, Haigh, Camus, Pender, Lamb, Trust and Kassem2022). In the United Kingdom, several studies have associated consecutive storms with the North Atlantic Oscillation (NAO) and the West European Pressure Anomaly (e.g., see Woodworth et al., Reference Woodworth, Flather, Williams, Wakelin and Jevrejeva2007; Castelle et al., Reference Castelle, Dodet, Masselink and Scott2017; Santos et al., Reference Santos, Haigh and Wahl2017). Other regions also show linkages between storm clustering and climate drivers. For instance, clustering near Florida has been connected to the NAO and the Southern Oscillation Index (e.g., see Jagger and Elsner, Reference Jagger and Elsner2012), while in Madagascar, storm activity has been linked to El Niño Southern Oscillation and the Madden–Julian Oscillation (e.g., see Nash et al., Reference Nash, Pribyl, Klein, Endfield, Kniveton and Adamson2015; Macron et al., Reference Macron, Richard, Garot, Bessafi, Pohl, Ratiarison and Razafindrabe2016). Nonetheless, the interaction between synoptic-scale processes and large-scale climate modes remains an open area of research, requiring further investigation across different regions and storm types.

Finally, despite ensuring the highest possible accuracy in our analysis, certain limitations must be acknowledged. The use of observational data introduces challenges, such as outliers, gaps and inconsistencies, which were addressed as much as possible through extensive preprocessing. Similarly, model-derived data can present validation issues, particularly in regions lacking in situ observations. However, our validation efforts focus not on the full validation of the CoDEC model itself, but specifically on its ability to reproduce clustering behavior. Another source of discrepancies are the return level estimations. The methodology used to estimate return levels is consistent across datasets; however, some differences may result from the declustering procedure. However, the declustering method has been previously validated and shown to be robust and applicable to both tide gauge time series and CoDEC data. Minor discrepancies may also stem from magnitude biases in CoDEC, although a general inspection indicates that the model captures event magnitudes reasonably well (here, the exact magnitude is less relevant as long as peaks exceed our thresholds of interest at the same time the tide gauges recorded a storm surge event). Finally, during the validation, the presence of gaps in the observational records (which were matched in CoDEC for consistency) may result in missing events and slightly lower cluster counts (in both datasets). However, this issue is mitigated in the global analysis, where CoDEC’s complete and continuous time series are used. All the methodological choices made were to provide a reliable framework for assessing storm surge clustering globally, highlighting the need for regional-scale analyses to further understand the mechanisms driving storm surge clustering and its implementation.

Conclusion

Storm clustering remains an underrepresented factor in coastal flood risk assessments despite its significant impact on coastal communities and ecosystems. How communities and natural ecosystems are prepared for back-to-back events will determine their ability to recover. Properly understanding the probability of these consecutive events and better characterizing their frequency is the first step to enhancing the resiliency of coastal communities to repeated shocks.

While storm clustering is inherently site-specific, this study provides a global perspective by analyzing different clustering definitions, event intensities and their spatial distributions. Moreover, we evaluate the applicability of the commonly used Poisson distribution and show that for relatively short return periods, this assumption is not satisfied at the vast majority of locations. Finally, we examine two important clustering definitions, short- and long-term clustering, which provide useful information about the recovery time between storms for each station. On average, the recovery time is limited to <2 years for 90% of the sites. These results underscore the importance of considering moderate events in clustering analyses, challenging the adequacy of Poisson-based assumptions and highlighting the need for alternative frequency models that account for event dependencies. Such models could then be used, among others, to generate boundary conditions for coastal impact models (such as the Dynamic Interactive Vulnerability Assessment model (Hinkel and Klein, Reference Hinkel and Klein2009) and the Coastal Impact and Adaptation Model (Diaz, Reference Diaz2016), which currently lack this information. This also includes integrating storm surge data with tidal information, an aspect not explicitly addressed in our study but essential for advancing toward comprehensive hazard and risk assessments. Finally, the adaptability of this framework provides a useful basis for extending the analysis to other types of hazards, enabling the inclusion of spatiotemporal patterns of consecutive events in broader risk assessments.

Open peer review

To view the open peer review materials for this article, please visit http://doi.org/10.1017/cft.2025.10008.

Supplementary material

The supplementary material for this article can be found at http://doi.org/10.1017/cft.2025.10008.

Data availability statement

All the data are publicly available. CoDEC data are available at the Climate Data Store (CDS) of the Copernicus Climate Change Service (C3S) at the following url: https://cds.climate.copernicus.eu/. GESLA-3 dataset can be downloaded at https://gesla.org/. The data used in the study will be available and can be downloaded from a Zenodo data repository (10.5281/zenodo.15097911). We provide the declustered time series for the 18,719 CoDEC coastal points for the 5 return levels used in the study and the list of the tide gauges used for validation.

Acknowledgments

A.R.E. acknowledges support by the European Union Horizon 2020 EXCELLENT SCIENCE – Marie Skłodowska-Curie Actions (Grant number: 101019470).

Author contribution

Funding acquisition: T.W. Data curation: A.M. Formal analysis: A.M. Methodology: A.M. and R.J. Project administration: T.W. Supervision: T.W. Writing original draft: A.M. and A.R.E. All authors approved the final submitted draft.

Financial support

This work was supported by the United States National Science Foundation (Grant number: 2141461).

Competing interests

The authors declare none.

References

Besio, G, Briganti, R, Romano, A, Mentaschi, L and De Girolamo, P (2017) Time clustering of wave storms in the Mediterranean Sea. Natural Hazards and Earth System Sciences 17(3), 505514. https://doi.org/10.5194/nhess-17-505-2017.Google Scholar
Bjerknes, J and Solberg, H (1922) Life cycle of cyclones and the polar front theory of atmospheric circulation. Geofyiske Publikasjoner 3, 468473.Google Scholar
Brunner, MI and Stahl, K (2023) Temporal hydrological drought clustering varies with climate and land-surface processes. Environmental Research Letters 18(3). https://doi.org/10.1088/1748-9326/acb8ca.Google Scholar
Castelle, B, Dodet, G, Masselink, G and Scott, T (2017) A new climate index controlling winter wave activity along the Atlantic coast of Europe: The West Europe Pressure Anomaly. Geophysical Research Letters 44(3), 13841392. https://doi.org/10.1002/2016GL072379;SUBPAGE:STRING:FULL.Google Scholar
Codiga, D (2025) UTide Unified Tidal Analysis and Prediction Functions. MATLAB Central File Exchange. https://www.mathworks.com/matlabcentral/fileexchange/46523-utide-unified-tidal-analysis-and-prediction-functions Retrieved August 12, 2025Google Scholar
Dacre, HF, Pinto, JG (2020) Serial clustering of extratropical cyclones: A review of where, when and why it occurs. npj Climate and Atmospheric Science 3(1). https://doi.org/10.1038/s41612-020-00152-9.Google Scholar
de Ruiter, MC, Couasnon, A, van den Homberg, MJC, Daniell, JE, Gill, JC, Ward, PJ (2020) Why we can no longer ignore consecutive disasters. Earth’s Future 8(3). https://doi.org/10.1029/2019EF001425.Google Scholar
Diaz, DB (2016) Estimating global damages from sea level rise with the coastal impact and adaptation model (CIAM). Climatic Change 137(1–2), 143156. https://doi.org/10.1007/S10584-016-1675-4/FIGURES/3.Google Scholar
Dollar, SJ and Tribble, GW (1993) Recurrent storm disturbance and recovery: A long-term study of coral communities in Hawaii*. U.S. Geological Survey 12(415).Google Scholar
Economou, T, Stephenson, DB, Pinto, JG, Shaffrey, LC and Zappa, G (2015) Serial clustering of extratropical cyclones in a multi-model ensemble of historical and future simulations. Quarterly Journal of the Royal Meteorological Society 141(693), 30763087. https://doi.org/10.1002/qj.2591.Google Scholar
Haigh, ID, Marcos, M, Talke, SA, Woodworth, PL, Hunter, JR, Hague, BS, Arns, A, Bradshaw, E and Thompson, P (2022) GESLA version 3: A major update to the global higher-frequency sea-level dataset. Geoscience Data Journal 10, 122. https://rmets.onlinelibrary.wiley.com/doi/10.1002/gdj3.174.Google Scholar
Hamideh, S, Peacock, WG and Van Zandt, S (2018) Housing recovery after disasters: Primary versus seasonal and vacation housing markets in coastal communities. Natural Hazards Review 19(2). https://doi.org/10.1061/(asce)nh.1527-6996.0000287.Google Scholar
Hersbach, H, Bell, B, Berrisford, P, Hirahara, S, Horányi, A, Muñoz-Sabater, J, Nicolas, J, Peubey, C, Radu, R, Schepers, D, Simmons, A, Soci, C, Abdalla, S, Abellan, X, Balsamo, G, Bechtold, P, Biavati, G, Bidlot, J, Bonavita, M, De Chiara, G, Dahlgren, P, Dee, D, Diamantakis, M, Dragani, R, Flemming, J, Forbes, R, Fuentes, M, Geer, A, Haimberger, L, Healy, S, Hogan, R J., Hólm, E, Janisková, M, Keeley, S, Laloyaux, P, Lopez, P, Lupu, C, Radnoti, G, de Rosnay, P, Rozum, I, Vamborg, F, Villaume, S, Thépaut, J (2020) The ERA5 global reanalysis. Quarterly Journal of the Royal Meteorological Society 146(730), 19992049. https://doi.org/10.1002/qj.3803.Google Scholar
Hinkel, J and Klein, RJT (2009) Integrating knowledge to assess coastal vulnerability to sea-level rise: The development of the DIVA tool. Global Environmental Change 19(3), 384395. https://doi.org/10.1016/J.GLOENVCHA.2009.03.002.Google Scholar
Jagger, TH and Elsner, JB (2012) Hurricane clusters in the vicinity of Florida. Journal of Applied Meteorology and Climatology 51(5), 869877. https://doi.org/10.1175/JAMC-D-11-0107.1.Google Scholar
Jenkins, LJ, Haigh, I, Camus, P, Pender, D, Lamb, R, Trust, J, Kassem, H (2022) The Temporal Clustering of Storm Surge, Wave Height, and High Sea Level Exceedances around the UK Coastline. Available at https://doi.org/10.21203/rs.3.rs-1412525/v1.Google Scholar
Karunarathna, H, Pender, D, Ranasinghe, R, Short, AD and Reeve, DE (2014) The effects of storm clustering on beach profile variability. Marine Geology 348, 103112. https://doi.org/10.1016/J.MARGEO.2013.12.007.Google Scholar
Macamo, CCF, Massuanganhe, E, Nicolau, DK, Bandeira, SO and Adams, JB (2016) Mangrove’s response to cyclone Eline (2000): What is happening 14 years later. Aquatic Botany 134, 1017. https://doi.org/10.1016/j.aquabot.2016.05.004.Google Scholar
Macron, C, Richard, Y, Garot, T, Bessafi, M, Pohl, B, Ratiarison, A and Razafindrabe, A (2016) Intraseasonal rainfall variability over Madagascar. Monthly Weather Review 144(5), 18771885. https://doi.org/10.1175/MWR-D-15-0077.1.Google Scholar
Mailier, PJ, Stephenson, DB, Ferro, CAT, Hodges, KI (2006) Serial Clustering of Extratropical Cyclones. Available at http://www.met.fu-berlin.de/wetterpate/Google Scholar
Martín, A, Wahl, T, Enriquez, AR and Jane, R (2024) Storm surge time series de-clustering using correlation analysis. Weather and Climate Extremes 45. https://doi.org/10.1016/j.wace.2024.100701.Google Scholar
Merz, B, Nguyen, VD and Vorogushyn, S (2016) Temporal clustering of floods in Germany: Do flood-rich and flood-poor periods exist? Journal of Hydrology 541, 824838. https://doi.org/10.1016/J.JHYDROL.2016.07.041.Google Scholar
Morton, R. A., Paine, J. G., Gibeaut, J. C. (1994). Stages and durations of post-storm beach recovery, southeastern Texas coast, U.S.A. Journal of Coastal Research 10(4).Google Scholar
Muis, S, Apecechea, MI, Dullaart, J, de Lima Rego, J, Madsen, KS, Su, J, Yan, K and Verlaan, M (2020) A high-resolution global dataset of extreme sea levels, tides, and storm surges, including future projections. Frontiers in Marine Science 7, 263. https://doi.org/10.3389/FMARS.2020.00263/XML/NLM.Google Scholar
Mumby, PJ, Vitolo, R and Stephenson, DB (2011) Temporal clustering of tropical cyclones and its ecosystem impacts. Proceedings of the National Academy of Sciences of the United States of America 108(43), 1762617630. https://doi.org/10.1073/PNAS.1100436108.Google Scholar
Nash, DJ, Pribyl, K, Klein, J, Endfield, GH, Kniveton, DR and Adamson, GCD (2015) Tropical cyclone activity over Madagascar during the late nineteenth century. International Journal of Climatology 35(11), 32493261. https://doi.org/10.1002/JOC.4204;PAGE:STRING:ARTICLE/CHAPTER.Google Scholar
Ripley, BD (1981) Spatial statistics. John Wiley & Sons, New York, 252. https://doi.org/10.1002/0471725218Google Scholar
Santos, VM, Haigh, ID and Wahl, T (2017) Spatial and temporal clustering analysis of extreme wave events around the UK coastline. Journal of Marine Science and Engineering 5(3). https://doi.org/10.3390/jmse5030028.Google Scholar
Towe, R, Tawn, J, Eastoe, E and Lamb, R (2020) Modelling the clustering of extreme events for short-term risk assessment. Journal of Agricultural, Biological, and Environmental Statistics 25(1), 3253. https://doi.org/10.1007/S13253-019-00376-0/FIGURES/10.Google Scholar
Villarini, G, Vecchi, GA and Smith, JA (2010) Modeling the dependence of tropical storm counts in the North Atlantic basin on climate indices. Monthly Weather Review 138(7), 26812705. https://doi.org/10.1175/2010MWR3315.1.Google Scholar
Vitolo, R, Stephenson, DB, Cook, LM and Mitchell-Wallace, K (2009) Serial clustering of intense European storms. Meteorologische Zeitschrift 18(4), 411424. https://doi.org/10.1127/0941-2948/2009/0393.Google Scholar
Woodworth, PL, Flather, RA, Williams, JA, Wakelin, SL and Jevrejeva, S (2007) The dependence of UK extreme sea levels and storm surges on the North Atlantic oscillation. Continental Shelf Research 27(7), 935946. https://doi.org/10.1016/J.CSR.2006.12.007.Google Scholar
Woodworth, PL, Hunter, JR, Marcos, M, Caldwell, P, Menéndez, M and Haigh, I (2016) Towards a global higher-frequency sea level dataset. Geoscience Data Journal 3(2), 5059. https://doi.org/10.1002/GDJ3.42.Google Scholar
Xi, D and Lin, N (2021) Sequential landfall of tropical cyclones in the United States: From historical records to climate projections. Geophysical Research Letters 48(21). https://doi.org/10.1029/2021GL094826.Google Scholar
Xi, D, Lin, N, Gori, A (2023) Increasing sequential tropical cyclone hazards along the US East and Gulf coasts. Nature Climate Change 13(3), 258265. https://doi.org/10.1038/s41558-023-01595-7.Google Scholar
Yao, Y, Fu, B, Liu, Y, Li, Y, Wang, S, Zhan, T, Wang, Y and Gao, D (2022) Evaluation of ecosystem resilience to drought based on drought intensity and recovery time. Agricultural and Forest Meteorology 314. https://doi.org/10.1016/j.agrformet.2022.108809.Google Scholar
Zscheischler, J, Martius, O, Westra, S, Bevacqua, E, Raymond, C, Horton, RM, van den Hurk, B, AghaKouchak, A, Jézéquel, A, Mahecha, MD, Maraun, D, Ramos, AM, Ridder, NN, Thiery, W, Vignotto, E (2020) A typology of compound weather and climate events. Nature Reviews Earth & Environment 1(7), 333347. https://doi.org/10.1038/s43017-020-0060-z.Google Scholar
Figure 0

Figure 1. Tropical storms and hurricanes that produced storm surges at the Apalachicola River tide gauge (red star), Florida, in 2005. The tracks of the tropical storms and hurricanes are shown in dashed lines with different colors. The inset plot indicates the inter-arrival times between these different storm surge events.

Figure 1

Figure 2. Identified clusters of storm surges at 527 coastal sites using varying thresholds and clustering windows. (a, b) Percentage of locations where clustering occurred based on GESLA (a) and CoDEC (b) data. (c, d) Percentage of locations where clustering behavior differs significantly from the assumption of a Poisson distribution for GESLA (c) and CoDEC (d) data.

Figure 2

Figure 3. Percentage of clustering (i.e., number of clusters divided by the number of events) when using a 90-day clustering window and a 1- (a, b) and 3-year (c, d) thresholds to identify storm surge events. Results are shown for GESLA (a, c) and CoDEC (b, d) data.

Figure 3

Figure 4. Percentage of clustering, defined as the number of clusters divided by the number of events, for clusters of <90 days. For events above the 1- (a) and 5-year (b) return levels, for all CoDEC coastal grid points.

Figure 4

Figure 5. Cumulative distribution function of the contribution of different return levels to (a) clusters and (b) the contribution of events that are not part of clusters. The cluster definition used is ≤90 days, and colors indicate the intensity. Panel (c) shows the contribution of events below (purple) and above (green) a 5-year return level to cluster with another 5-year return level (or higher) within 90 days.

Figure 5

Figure 6. Median (a) and standard deviation (b) of the inter-arrival times shorter than 180 days for all CoDEC coastal grid points and events above the 1-year return level.

Figure 6

Figure 7. Number of years without any events above the 1-year return level (a) and median time between events when they occur >180 days apart from each other (which we define as recovery time).

Supplementary material: File

Martín et al. supplementary material

Martín et al. supplementary material
Download Martín et al. supplementary material(File)
File 2.6 MB

Author comment: Global analysis of temporal clusters of storm surges — R0/PR1

Comments

Dear Editor,

I am sharing with you a manuscript entitled “Global Analysis of Temporal Clusters of Storm Surges” to be considered for publication in Cambridge Prisms: Coastal Futures.

This paper presents a global analysis of the temporal clustering of storm surges, identifying hotspots of clustering for different definitions of clustering. The paper also tests the validity of the commonly used Poisson distribution to represent the observed clustering. The analysis is performed on model hindcast data from the Coastal Dataset for the Evaluation of Climate Impact 2020 (CoDEC), after validating it using in-situ observations from the GESLA 3.0 dataset. The results also contain the analysis of the inter-arrival times divided into two regimes: short-term clustering and long-term clustering (indicative of recovery times); as well as the analysis of the contribution of events with different magnitudes to clustering. The results from the analysis are made available through a Zenodo repository.

I certify that the submitted paper has not been published previously, that its publication is approved by all authors, and that, if accepted, it will not be published elsewhere. We look forward to hearing back from you and hope that our paper is of interest to Cambridge Prisms: Coastal Futures.

Best regards,

Ariadna Martín.

Review: Global analysis of temporal clusters of storm surges — R0/PR2

Conflict of interest statement

Reviewer declares none.

Comments

The paper, ‘Global Analysis of Temporal Clusters of Storm Surges’ by Martin et al. assesses the likelihood that extreme storm surge occur in quick succession using both global tide gauge and model datasets. Overall, I think the idea of the paper is novel and an important research topic, however several revisions are required to strengthen and justify the findings.

Main comments

Focus/scope of paper: The abstract makes it appear as if testing whether it is Poisson distributed is the main objective, whereas the title and most of the manuscript makes it sound more general: to study clustering of storm surge events and the implications of this for hazard modelling. I think the paper as it currently stands would be stronger (and require fewer revisions) if it is the latter, so I am going to base my subsequent feedback on this.

Storm surge terminology: e.g., “Among the spectrum of coastal hazards, storm surges…” Ultimately the hazard is flooding or erosion (refer next comment about erosion). At the very least you need to replace references to “storm surges” to be “extreme storm surges” or “extreme positive storm surges” as the way this paper (and most others) define storm surge they have mean zero and can be positive or negative. By only considering storm surges with return periods of between 1 and 5 years this paper only considers large positive storm surges. I think it is beyond the scope of the paper to consider in detail, but it could be worth mentioning how the ability for storm surges to cause flooding are modulated by tides and this is a limitation of the study’s relevance for flood risk assessments (e.g., Enriquez et al. 2022: https://doi.org/10.1029/2021JC018157, Hague et al. 2023: https://doi.org/10.1029/2023EF003784, Palmer et al. 2024: https://doi.org/10.1029/2023JC020737 ).

Erosion and scope of paper: e.g., “moderate storms can resemble the erosion caused by a single extreme event” and “ Storm surge events impact a wide range of natural ecosystems, including beaches, dunes, as well as public infrastructure and coastal communities”. The data used in this paper cannot be used to infer erosion hazards as GESLA and CODEC do not include the impacts of wave setup or runup on coastal water levels. To resolve this, discussion of erosion and open coast (e.g., beaches) impacts need to be removed, or alternative data sources that include waves need to be considered. Ideally alternative data sources would be the solution as I suspect cumulative impacts of erosion are in important part of the impacts of clustering, but I understand this is a lot of additional work so it is sufficient to limit the scope to still water level or storm-tide.

Tide-surge independence: Ultimately the delineation of what part of the water level is considered “surge” and what is considered “tide” is subjective (e.g., see Hague & Taylor 2021: https://doi.org/10.1007/s11069-021-04600-4). This follows from tides being defined via harmonic analysis which is fitting sums of sines to observations as a statistical (i.e., non-physical) approximation of (un-observable) physical processes (more on this later). As storm surges are defined as “not tide”, this means that storm surge is also impacted by how tides are defined. There has been a lot of work done on the interactions between storm surges and tides and other water level components (e.g., Arns et al. 2020: https://doi.org/10.1038/s41467-020-15752-5, Horsburgh & Wilson 2007: https://doi.org/10.1029/2006JC004033, Moftakhari et al. 2024: https://doi.org/10.1029/2023AV000996, Idier et al. 2019: https://doi.org/10.1007/s10712-019-09549-5). This has led to the promotion of skew surges as an alternative to the non-tidal residual because it is largely independent of tides – see Williams et al. 2016 https://doi.org/10.1002/2016GL069522. It would be worth repeating at least part of the analysis using a skew surge metric to assess the sensitivity of the results to different metrics for the physical process of extreme storm surges. This could verify that the findings are not simply due to the choice to use the non-tidal residuals as the metric for storm surge – the distribution of which is likely modified by tide-surge interactions. This could be something like what is described in the first paragraph of Section 2.2 – I found this approach to sensitivity testing very good!

Year-by-year analysis: Performing harmonic analysis on a year-by-year basis without SA and SSA means that seasonal variations in tidal heights due to the seasonal cycle of mean sea level are included as part of the “storm surge” component of the water level rather than the tides. There isn’t anything wrong with this per se but to facilitate fair comparisons with CODEC, it would be best to handle the seasonality in a consistent way for both gauge and model estimates of storm surges. Please include some discussion on this.

More details on comparison between GESLA and CODEC: I liked sentence: “ However, on a global scale, these differences are minor, and the spatial clustering patterns are consistent across both datasets.” It would be great to see this statement justified with some data (e.g., RMSE or percentage error) to quantify that the differences are “minor”.

Context for statistics: I struggle to interpret the statistic “14% of the 5-year return level events were part of a cluster” and others in Section 3.2. Am I meant to be impressed by how high, low or expected that number is? It says in methods that Ripley’s K metric is used. Can an assessment using Ripley’s K be provided to give an indication of the notability (and sign e.g., higher or lower than expected) of such results? Similarly, for “the average interarrival time across all coastal points is 26 days” in Section 3.3.

Riverine discharge in GESLA: The discussion includes: “We hypothesize that discrepancies arise primarily in regions where external factors, such as freshwater discharge” This could be tested using the GESLA dataset which classifies sites as “coastal”, “river” or “lake”. For example, does CODEC perform worse when compared “river” gauges than “coastal” gauges?

Implications for coastal hazard assessment: The abstract states that “These findings offer insights into temporal clustering dynamics of storm surges and their implications for coastal hazard assessments.” More details are required on the implications for coastal hazard assessments. For example, what do coastal hazard assessment methods assume that is invalidated by the findings? How different are results of risk assessments if clustering is considered? It could be good to link the coastal adaptation literature which often details how risk assessment information is used (e.g., Haasnoot et al. 2024: https://doi.org/10.1016/j.gloenvcha.2024.102907).

Minor comments:

Last Paragraph Page 11: “randomly distributed” – specify the distribution as lots of possible ways for random variables to be distributed.

Last Paragraph Page 10: I don’t the repetition of the methods is required here, could rephrase as “…challenges the suitability of this assumption. We compare the observed clustered…”

Review: Global analysis of temporal clusters of storm surges — R0/PR3

Conflict of interest statement

Reviewer declares none.

Comments

Summary

This paper examines the temporal distribution of storm surge events, an important topic and one that deserves greater attention. Overall, this paper presents compelling evidence of spatial and temporal patterns in event timing which indicate that commonly adopted methods (e.g., random distribution) may be unsuitable for coastal planning needs. The clustering and timing of events is critical for coastal hazard assessment because the amount of time between arrival times of individual storm events may be more consequential than the magnitude of any one of those events. This is important both for shorter than average inter-arrival times, where the impact of an event compounds the impacts of a recent event (or events), and for longer than average inter-arrival times, where greater potential of recovery increases the resilience to future events. I congratulate the authors on the in-depth and comprehensive approach to examining clustering and timing and the global application of the analyses. There are some questions remaining that I believe should be addressed to better explain the results and thus make this work more impactful.

The five major points which should be addressed are as follows:

1. While the justification for the work is well articulated, the key themes and conclusions would benefit from clearer structuring in the introduction, and signalling (and ordering) the distinct steps throughout the methods, results, and discussion – e.g., by including a paragraph at the end of the introduction that explains how the paper will proceed, and by structuring the ensuing methods/results/discussion stepwise with sub-headings.

2. Figure 1 is great for illustrating the importance of clustering, it would be helpful to provide an additional figure in the methods section clearly illustrating the different thresholds and windows (perhaps using the same Apalachicola tide gauge). This would help in interpreting the pattern in the identification of significant clusters presented in Figure 2. The relevance of seasonality could also be highlighted in Figure 2 (e.g., by shading the length of time feasibly defining an overlap in two storm seasons ~400 days). The effects of different RLs and cluster windows on the identification (and definition) of clustering is key.

3. The importance of the topic is well stated, but the importance of the contribution of this work needs to be clearer. There are a number of advances in methodology that are not prominent enough, and some missed opportunities to present a useful approach for application. To present clearer evidence of the contribution and for a stronger conclusion, the strengths and limitations of each preceding step need to be clearly stated and quantified well before the final paragraph. Specifically:

a. CODEC model validation

b. Return level estimates

c. Clustering definition

The statement of limitations in the final paragraph of the discussion skipped over these points, missing the level of detail required to be convincing.

4. The definition of clustering should be more precisely stated following the analysis of clustering thresholds and inter-arrival times, and the effects of different choices discussed. What is the threshold and clustering window of greatest use for coastal planning? Should 1-YR RL events be included in the definition of a cluster (isn’t recurrence within 1-year consistent with the assumption of random distribution)? Is a 730-day window justified/supported in literature for identifying clusters? Is it logical to say that including more frequently occurring events results in higher clustering?

5. From the results onwards, the authors discuss the presence or significance of clustering defined as two events within the specified time window. It would be highly relevant to further quantify and discuss the average and variation in the number of events in identified clusters, their composition, and the possible role of non-stationarity in spatial variations – especially as the GESLA observations are compared over different decades. One important point that I felt was lacking was the role of dependency within clusters, in addition to between individual clusters, and the correlation between storm-dense seasons and external factors (which could likely be drawn from the literature rather than by extending the analysis).

In addition, I offer the following minor points for the authors to consider at their discretion:

• P1 line 42: the sentence “Here, we raise the question…” seems to contradict the second sentence – Poisson distributions are used to assess storm surge frequency (not clustering).

• P2 line 55: it isn’t clear to me why tropical cyclones occurring during certain seasons are an example of less detrimental long-term effects.

• P3 line 29: this is a good point, but I didn’t see how the preceding sentences and references justified why the definition of a cluster is dependent on local and regional characteristics.

• P4 line 21: more detail is required to describe the tidal analysis to ensure repeatability (i.e., default constituents? SNR?). Also, check typo in last sentence of this paragraph.

• P4 line 53: this paragraph could be rearranged/revised to improve clarity.

• P5 line 19: the K metric could be better utilised in the results and discussion (including a plot in the supp?) to justify your statements that the presence of clustering violates the Poisson distribution assumptions. A plot of K vs clustering window would be helpful following paragraph on P6 lines 15-23.

• P6 lines 15-17: this sentence didn’t make sense to me, how can the clustering violate the Poisson principal but also occur in a way that aligns with a Poisson process? This is the point in the manuscript where I began to get muddled in the cluster windows and RL thresholds used to define clusters in events.

• P6 line 52: this is interesting, I would also like to know what % of the time a 3rd and 4th event another event occurs (with a figure!).

• P7 line 19: please quantify “these differences are minor” and preferably include a map of the differences or RMSE.

• Section 3.2: I suggest avoiding the tendency to lead with explaining figures, rather than leading with analysis supported by figures.

• P8 line 41: “events exceeding high return levels can significantly contribute to clustering” could be better phrased.

• P9 line 6: “an alternative approach” reads like introducing a new idea, instead link to the introduction (justification).

• P9 line 28-29: the choice of 180 days could benefit from further (and quantified) justification.

• P9 line 54: “note that years do not need to be consecutive” came as a surprise here, would benefit from further justification.

• P10 lines 24-30: these statements seem a little vague, would be stronger supported with statistics.

• P10 lines 33-52: the first sentence seems to be self-explanatory, a more specific intro sentence would help to make the importance of this paragraph obvious. In particular the point regarding the relationship between cluster frequency and event intensity seems understated. Here is where I would have liked to hear what the preferred selection criteria are for defining clustered events.

• P11 lines 23-27: it would be beneficial to further tie your findings back to the literature… how do your results inform these alternative approaches?

• P12 lines 13-15: no need to redefine recovery time here.

• P12 lines 23-27: please clarify these results, for how many sites? (from GESLA observations or from CODEC?)

• P12 lines 29-39: the concluding sentence of the discussion relies on a little too much faith from the reader, your expansive analyses deserve more decisive recommendations.

• P12 lines 54-60: this presentation of the synthesis of results comes too late in my view.

Recommendation: Global analysis of temporal clusters of storm surges — R0/PR4

Comments

We have now received two reviews of your paper, and I am pleased to see that both reviewers were relatively happy with the paper. Their comments are mainly around clarity and structure, with some recommendations in relation to analysis/changes that could easily satisfy their concerns. You have not addressed the important role that tides play in modulating the effect of a surge, and the reviewer provides suggestions on how you can do this. In summary, I believe you should be able to address the reviewers' concerns adequately.

Decision: Global analysis of temporal clusters of storm surges — R0/PR5

Comments

No accompanying comment.

Author comment: Global analysis of temporal clusters of storm surges — R1/PR6

Comments

Dear Editor,

I am sharing with you a manuscript entitled “Global Analysis of Temporal Clusters of Storm Surges” to be considered for publication in Cambridge Prisms: Coastal Futures.

This paper presents a global analysis of the temporal clustering of storm surges, identifying hotspots of clustering for different definitions of clustering. The paper also tests the validity of the commonly used Poisson distribution to represent the observed clustering. The analysis is performed on model hindcast data from the Coastal Dataset for the Evaluation of Climate Impact 2020 (CoDEC), after validating it using in-situ observations from the GESLA 3.0 dataset. The results also contain the analysis of the inter-arrival times divided into two regimes: short-term clustering and long-term clustering (indicative of recovery times); as well as the analysis of the contribution of events with different magnitudes to clustering. The results from the analysis are made available through a Zenodo repository.

I certify that the submitted paper has not been published previously, that its publication is approved by all authors, and that, if accepted, it will not be published elsewhere. We look forward to hearing back from you and hope that our paper is of interest to Cambridge Prisms: Coastal Futures.

Best regards,

Ariadna Martín.

Review: Global analysis of temporal clusters of storm surges — R1/PR7

Conflict of interest statement

Reviewer declares none.

Comments

The author/s has responded comprehensively to reviewer comments and I congratulate them on a really interesting and important research contribution. I have no hesitation in recommending this submission for prompt publication.

Review: Global analysis of temporal clusters of storm surges — R1/PR8

Conflict of interest statement

Nil.

Comments

Addressed changes satisfactorily - thanks! Will be great to see this published.

Recommendation: Global analysis of temporal clusters of storm surges — R1/PR9

Comments

Both reviewers are happy with your responses, and I am looking forward to seeing this work published.

Decision: Global analysis of temporal clusters of storm surges — R1/PR10

Comments

No accompanying comment.