SOME CONSIDERATIONS ON EXTREME STATISTICS OF STORM SURGE HEIGHT

 

MASATAKA YAMAGUCHI 1 and YOSHIO HATADA 2

 

Department of Civil and Environmental Engineering, Ehime University

Bunkyocho 3, Matsuyama 790-8577, Japan

1 tel. +81-89-927-9832, fax. +81-89-927-9844, e-mail myamag@en2.ehime-u.ac.jp

2 tel. +81-89-927-9838, fax. +81-89-927-9844, e-mail hatada@en2.ehime-u.ac.jp

 

 

ABSTRACT

The return values and their standard deviations for typhoon-generated storm surge height measured at Osaka for a period of 94 years from 1902 to 1995 are estimated by using an extended version of Goda's extreme analysis model based on the least square method, in which the candidate distributions are the Gumbel and Weibull distributions. Investigations are conducted as to the effects of factors such as the censoring rate and the number of years for the measurement on return storm surge height and its standard deviation. Main conclusions are that increase of sample size associated with lowering of the censoring point and extension of the measurement period are crucial to improving the accuracy and efficiency of the estimates and that an extreme analysis using the data stratified according to typhoon track does not necessarily produce more efficient estimates in this case.

 

Keywords: typhoon-generated storm surge height data, extreme statistics, least square method, sensitivity analysis, stratified sampling technique, Osaka Bay

 

INTRODUCTION

In the design of coastal structures such as breakwaters and sea dikes, reliable estimation for extremes of sea levels, particularly storm surge heights to be expected during a life time at the site is an indispensable step. Extreme statistics for storm surge height have been extensively studied, in which case the most important condition in extreme analysis is to use samples of large size over long years which have approximately stationary, independent and homogeneous properties. In Japan, it is nearly impossible to gather measurement data of storm surge height over more than 50 years, but we have fortunately been able to obtain the measurement data over 94 years of typhoon-generated peak storm surge heights at Osaka, where typhoon-generated storm surges are dominant.

This paper investigates some problems concerning the extreme statistical analysis from viewpoints of (1) censoring rate and year periods of the data, (2) annual maximum data and data of peak over threshold, (3) stability of the estimates of extremes and (4) homogeneity of the data, through a case study using the long-year data of storm surge height at Osaka.

 

DESCRIPTION OF STORM SURGE HEIGHT DATA

The data set of typhoon-generated peak storm surge height measured at Osaka is made through a wide survey of many reports published by the Japan Meteorological Agency and the others, in which the period is 94 years ranging from 1902 to 1995. The effect of storm surge heights generated by storms excluding typhoons on the estimates of return storm surge heights is negligible, because they are considerably smaller than the values during typhoons.

Figure 1 shows a sketch of Osaka Bay and the bay axis passing through Osaka and Tomogashima. Annual maximum series data with sample size of 72 are selected from the peak storm surge height data of 125 cases greater than 20 cm. The largest surge height is 292 cm generated by T3412 (the 12th typhoon in 1934) and the second, third and fourth largest surge heights are respectively 245 cm (T6118), 237 cm (T5028) and 216 cm (T6523). These are all records of surge height exceeding 200 cm at Osaka during a period of 94 years. As is well-known, typhoon winds have different characteristics in the right semi-circle and in the left semi-circle. In order to make the data homogeneous, annual maximum series data stratified into 2 groups according to typhoon track near Osaka Bay are re-selected from the original data set. In this case, each typhoon is classified into either a typhoon which passed through the western area to the bay axis or a typhoon which passed through the eastern area to the bay axis.

Figure 2 indicates yearly variation of annual maximum surge height. Anomalous surge heights occurred intensively over a period of 21 years from 1945 to 1965 and the maximum of the surge height since 1966 is only 134 cm generated by T7916. A decreasing trend of 0.4 cm per year on average can be observed, but the extreme analysis is proceeded under the assumption of an insignificant trend.

 

Figure 1. Sketch of Osaka Bay and bay axis passing through Osaka and

 

Figure 2. Yearly variation of annual maximum storm surge height.

 

OUTLINE OF THE SYSTEM FOR EXTREME ANALYSIS

The system using the least square method for the parameter estimation in the probability distribution is applied, which was originally constructed by Goda (1988, 1990) and extended by the authors (1997). The system makes use of the Gumbel distribution and the 3-parameter Weibull distribution with a fine resolution of the shape parameters varying from 0.5 to 100 as the candidate distributions, and the criterion of the largest correlation coefficient between the ordered data and its reduced variates for the selection of the optimum distribution. The plotting position is based on Goda's empirical formula proper to each distribution. The standard deviation of return value is evaluated by the application of a jackknife method (Miller, 1974). The system becomes consistently available for not only annual maximum data but also data of peak over threshold irrespective of the presence of data censoring, by introducing two factors of the mean occurrence rate and the censoring rate , where NT is the total number of events, K the year period of data and N the number of data used in the analysis.

In the case of data grouped by typhoon track, the overall probability distribution F(x) and R-year return value xR are evaluated on the basis of theoretical formula for compounding the optimum probability distribution for each stratified data Fj (x) and the estimation of the overall variance relies on Goda's empirical formula (1990) for compounding the variance for each stratified data. These are respectively expressed as:

(1), (2)

where n (=2) is the number of stratum and Nj the sample size of stratified data. Eq. (2) indicates an average of each variance weighted with both the sample size and the exceedance probability of R-year return value xR.

 

ESTIMATION OF RETURN STORM SURGE HEIGHT

 

EFFECTS OF DATA CENSORING RATE AND YEAR PERIOD OF DATA

Table 1 indicates the results of extreme analysis for the storm surge height data of annual maximum and peak over threshold in 94 years from 1902 to 1995, in which the listed values are the year period of measurement K, the number of data used in the analysis N, the shape parameter k of the optimum Weibull distribution, the correlation coefficient , the R-year (R=100, 200, 500) return values and their standard deviations , and the measured maximum storm surge height . The analysis is separately made for case of all gathered data and cases of data over threshold values of =50, 75 and 100 cm. The total number of events NT required in the analysis for data of peak over threshold is roughly estimated as the number of typhoons which passed through the area within a radius of hundreds of kilometers centered at Osaka Bay, that is, the average number per year =3.33 multiplied by the year period of data K=94. This is based on the fact that not all of the typhoon tracks before 1940s are identified and Goda's suggestion (1990) that an accurate estimation for NT is not required with an allowable magnitude factor of 2. The feature to be pointed out firstly is that the difference between the results of analysis using both extreme data is small and practically negligible. From this fact, only the results estimated using the annual maximum data are hereafter discussed. Another indication is that the return values change little with the reduction of the censoring rate, whereas the standard deviations gradually increase, meaning less efficient estimates of return values. According to the results estimated using all of the gathered data, the R-year return value and its standard deviation is 270 ± 32 cm for R=100 and 309 ± 37 cm for R=200 respectively, and the return period for the largest storm surge height of 292 cm associated with T3412 is 148 years.

 

Table 1. Effect of censoring rate in annual maximum storm surge height data and peak storm surge height data on return value and standard deviation.

 

The effect of year period for measurement on the results of extreme analysis is indicated in Table 2, in which the data are sub-grouped by 3 year periods. The maximum of the difference among the 500-year return value is only 28 cm which is less than the standard deviation. But, the standard deviation takes a larger value with reduction of measurement period, which indicates less efficient estimate of the return value. The return value and its standard deviation tends to become larger in cases where anomalous high values are included in the data of a shorter period. This leads to the conclusion that an increase of the time span of measurements is an essential condition for improving the reliability of the estimates of return values.

 

Table 2. Effect of number of years for measurement on return storm surge height and standard deviation.

 

STABILITY OF THE ESTIMATE

In order to investigate the stability of the estimates of return storm surge heights, the extreme analysis is made by using 3 sub-grouped data sets. These are (1) the data set which excludes the largest value of 292 cm, (2) the data set selected every even year, which includes the largest value of 292 cm but does not include the second largest value of 245 cm and (3) the data set selected every odd year, which includes the second largest value of 245 cm but does not include the largest value of 292 cm. Table 3 summarizes the results of extreme analysis. Exclusion of the largest value in the analysis gives rise to slightly larger estimates of the return values and their standard deviations, but the estimates may be said to be relatively stable, because the difference of the return value is not significant, if the standard deviation is taken into account. On the other hand, the return values and their standard deviations estimated using the data of even years are much greater than those estimated using the data of odd years. The analyzed results are accompanied with large variations, depending on the presence of anomalous data, as the occurrences of extraordinary storm surges are rare even in a year period for measurement of almost 100 years.

 

Table 3. Stability of estimated return storm surge height and standard deviation.

 

Similar variations can be observed more clearly in Table 4, in which case the analysis is separately conducted using the data of 47 years sub-grouped by the periods of (1) 1902 to 1948, (2) 1926 to 1972 and (3) 1949 to 1995. The largest and second largest surge heights occurred in 1934 and 1961 respectively. The greater the number of anomalous values in the data set is, the larger the return value is, and the difference among return values estimated using the 3 data sets comes to significant magnitude comparable to the minimum of the standard deviations. The standard deviations estimated using the data of 1902 to 1948 take a fairly large value by reason of smaller size of the sample and smaller shape parameter of the optimum distribution. As was mentioned above, the estimates of return values might vary significantly, depending on the year period of data acquisition. Therefore, increase of the time span of measurements is an indispensable condition for obtaining stable estimates of return values.

 

Table 4. Change of return storm surge height and standard deviation with 3

 

HOMOGENEITY OF EXTREME DATA

Table 5 illustrates the results of extreme analysis estimated using the 2 data sets for a period of 63 years from 1933 to 1995 which are stratified by the typhoon track. The last column gives the return values and their standard deviations aggregated by eqs. (1) and (2), and index 'G' in the third column means the Gumbel distribution. The return values of surge heights associated with the typhoons which passed through the western area to the bay axis are rather greater than those with the typhoons which took the eastern area track, and the resulting aggregated return values are almost identical to those for the typhoons of the eastern area track and/or those estimated using all of the gathered data. These are true for the standard deviations, but the correlation coefficient in the case of stratified data is slightly lower compared to that in the case of unstratified data. Advantages of a stratified sampling technique over the usual method cannot be found in the extreme analysis for the storm surge data at Osaka, probably due to the augment of statistical variability of the data associated with the stratification.

 

Table 5. Return storm surge height and standard deviation estimated from data stratified by typhoon track.

 

CONCLUSIONS

Some problems concerning the extreme data analysis are investigated through the case study that analyzes the typhoon-generated maximum storm surge height data measured at Osaka during a 94 year period of 1902 to1995 from variety of viewpoints. Main results are summarized as follows.

(1) Increase of sample size associated with lowering of censoring point and extension of a year period for measurement are essential to improving the accuracy and efficiency for the estimates of the return values.

(2) Data selection method such as annual maximum method or peak over threshold method produces little difference in the estimates of return values and their standard deviations for the measurement data at Osaka.

(3) A data stratification method does not necessarily yield better estimates of return values compared to the conventional method, probably due to sample variability in the present case.

 

REFERENCES

Goda, Y. (1988): On the methodology of selecting design wave height, Proc. 21st ICCE, Vol.1, pp.899-913.

Goda, Y. (1990): Design of Harbour Structures against Random Sea Waves-Introduction to Wave Engineering (2nd edit.), Kajima Pub., p.333 (in Japanese).

Miller, R.G. (1974): The jackknife-a review, Biometrica, Vol.61, No.1, pp.1-15.

Yamaguchi, M. and Y. Hatada (1997): An extremal analysis system and its application to the estimation of meteorological and oceanographic elements around the coasts of Japan, Ocean Wave Measurement and Analysis (WAVES97), Vol.2, pp.932-946.