Sobri Harun1,
Van Thanh Nguyen2,
Mohd Shafiek Yaacob3,
Hishamuddin Jamaluddin4
1 Lecturer, Faculty of Civil Engineering, Universiti Teknologi Malaysia,
81310 UTM Skudai, Johor, Malaysia. sobri@fka.utm.my
2 Professor, Dept. of Civil Engineering and Applied Mechanics, McGill University,
817 Sherbrooke Street West, Montreal, Canada H3A 2K6. nguyen@civil.lan.mcgill.ca
3 Lecturer; 4 Associate Professor, Faculty of Mechanical Engineering,
Universiti Teknologi Malaysia, 81310 UTM Skudai,
Johor, Malaysia. shafiek@fkm.utm.my; hisham@fkm.utm.my
Abstract: Studies on the time series modeling of monthly reservoir inflows sequences using neural network method are presented. Based on multilayer perceptron (MLP) and radial basis function (RBF)methods, two types of neural network models were proposed to forecast the net inflow series into Pedu-Muda reservoir system in Kedah, Malaysia. The analyses determine the feasiblity and accuracy of neural network model in univariate forecasting of net inflows. Results of neural network forecast were compared with seasonal autoregresive integrated moving average (ARIMA) and adaptive fuzzy model (FM). The illustrative model’s application indicates that MLP, RBF and FM models provide a promising forecast of net inflow series into a reservoir.
Keywords: ARIMA, forecast, fuzzy, inflow, neural network
The autoregressive integrated moving average, ARIMA models [2] have been extensively used in hydrology and water resources for modeling of streamflow sequences. The natural hydrological processes that often contain the seasonal component could be handled by using multiplicative seasonal ARIMA model. Applications of multiplicative seasonal ARIMA in flow forecasting are illustrated by [8,9]. Recently, there are increasing number of works attempting to apply the neural network method for solving various problems in different branches of science and engineering. The characteristics of the neural networks is that they provide a computational or mathematical technique which is powerful for modeling systems where explicit form of the relationship between the variables involved is unknown [3]. It was discovered by various researchers that neural network method is successful in capturing non-linearity in the natural process after exposed to sufficient learning or training process. There have been various applications of neural networks in the field of hydrology and water resources such as for modeling rainfall-runoff processes [5,6]; river flow prediction [7]; and water demand time series forecasting [4].
The neural network method [1,3] is applied in the univariate forecasting using monthly net inflow series in the Pedu-Muda reservoir system in Kedah, Malaysia. Based on multilayer perceptron and radial basis function methods, two types of univariate forecasting model were proposed. The performance of the proposed neural network models is assessed using errors criteria. Results of the univariate net inflow forecast are compared to the multiplicative seasonal ARIMA and adaptive fuzzy models [10].
The relationship between the current and previous net inflow series is believed to be highly non-linear, time varying, and possesses a stochastic behaviour. Hence the application of multilayer perceptron and radial basis function methods in the univariate net inflow forecasting is appropriate. In the univariate net inflow forecasting models, the input and output node consist of the net inflow series itself. The models are designed having several input nodes and one output node. In the proposed model the input nodes are the previous net inflow series and the output node is the current net inflow data.
The first technique of neural network
modeling is the multilayer perceptron (MLP) model. Basically the MLP consists of
an input layer, a hidden layer, and an output layer. The proposed MLP model
consists of an input layer linked to the input net inflow variables
, a hidden layer, and an output layer that connects to the output net inflow
variable. Figure 1 illustrates the architecture of the proposed univariate net
inflow forecasting model.
Input layer is the previous net inflow data and the output layer constitute the current net inflow data. The information of previous net inflow in the input layer transfer to the next consecutive layers in the system of feed forward networks. The binary sigmoid activation function will process the signal send by net inflow input that passes from each node. Associated with each incoming net inflow signal is a weight. Each input node unit (i = 1,...,m) in input layer broadcast the net inflow signal to hidden layer. The input signal resulting from the input net inflow data. Each hidden node (j = 1,....,n) sums its weighted input signals,
(1)
applies its activation function to compute its output signal from net inflow,
(2)
and sends this
signal to all units in the hidden layer, which
is the weight between input layer
and hidden layer,
is the weight for the bias; and
is the input net inflow signal.
The binary sigmoid activation function will process the signal that passes from
each node:
(3)
Then from second layer the signal is
transmitted to third layer. The output unit (k = 1) sums its weighted input
signals, and applies its activation function to compute its output signal, where
is the weight between second
layer and third layer; and
is the weight for the bias. The
output node (k= 1) receives a target pattern corresponding to the input training
pattern, computes its error (between target output and computed output)
information term, calculates its weight correction term (used to update
later), calculates its bias
correction term (used to update
later).
is the neural network output as
net inflow variable. The error information is conveyed from the output layer
back to early layers. This is known as the backpropagation of the output error
to the input nodes to correct the weights. This method uses partial derivatives
of error with respect to weights to update the weights of the connections that
allows a multilayer network to change its weights in response to the output
error. The weights are updated continuously until minimum error is achieved. In
this study the steepest descent method with momentum constant is applied.for a
faster convergence.
The second technique of the neural
network modeling is the radial basis function (RBF). It is recognized that the
RBF neural networks is a curve fitting problem. Therefore the learning process
is to find the best fit to the training data. Figure 2 illustrates the designed
architecture of the RBF model for the univariate net inflow forecasting models.
In the Figure 2, the Euclidean length is represented by rj that
measures the radial distance between the datum vector
; and the radial centre
; and can be written as:
.=
(4)
A suitable transfer function is then applied to rj to give,
(5)
Finally the
output layer (k = 1) receives a weighted linear combination of
,
(6)
The transfer functions of the nodes
are governed by nonlinear functions that is assumed to be an approximation of
the influence that data points have at the center. The decisions that will
affect the performance of the radial basis function network during training
include the number of centers, the transfer function and the parameter of the
nonlinear functions. The transfer functions applied in this work are the radial
spline
; Gaussian
; and multiquadratic
; where b is a fixed constant; c is a constant and r is the Euclidean error distance.
The seasonal autoregressive integrated moving average, ARIMA model is designed as the univariate time series forecasting of reservoir net inflows. The development of seasonal ARIMA models requires three steps: identification, estimation and diagnostic checking. The best model of seasonal ARIMA is chosen based on the minimum Akaike Information Criterion (AIC) and the white noise of residuals. The Box-Ljung test is used to assess the white noise of residuals. A general description of the multiplicative seasonal ARIMA model for monthly net inflows is given as follows,
(7)
where, for
this study seasonal period s = 12;
is the observed net inflow
discrete time series; at is a zero-mean white disturbance sequence
with constant variance s2; B is the unit backshift operator defined
by
;
and
are difference operators which
remove nonstationarities in the observations; f and F are the autoregressive coefficients; q and Q are the moving average coefficients; p
is the order of autoregressive component; q is the order of moving average
components; P is the order of seasonal autoregressive component; Q is the order
of seasonal moving average component; d is the degree of differencing; and D is
the degree of seasonal differencing.
2.3 Adaptive fuzzy model
The univariate forecasting of net
inflow have been carried out by [10] using the adaptive fuzzy model (FM). The
methodology related to training and forecasting had been described details in
their work. Basically, the number of input nodes for FM is similar to the neural
network. [10] employed M = 12 number of rules, one possible rule
for every month. The spread of the membership functions of the input variables
was chosen s = 60, and the learning rate for all
models was a = 0.2.
Record of 27 years of monthly net inflow series of Pedu-Muda dams (Kedah, Malaysia) is selected to evaluate the performance of the neural network, adaptive fuzzy and seasonal ARIMA models. The two dams are connected by a free flow Saiong Tunnel and can be treated as a single reservoir system. For model development, 18 years of data are used, and for model testing, data for the remaining period of 9 years are considered. The models are proposed to forecast the next net inflow yt+1, with known previous net inflow series. The optimal number of input node is important in the neural network and fuzzy model design and it is determined based on the following methods:
(1) Correlation method - analysis of partial autocorrelation function (PACF):
yt , yt-1 , yt-3 , yt-7 , yt-8 , yt-10 , yt-11
(2) Regression method - using stepwise regression:
yt , yt-2 , yt-7 , yt-10
(3) Adopt the whole 12 months data:
yt , yt-1 , yt-2 , yt-3 , yt-4 , yt-5 , yt-6 , yt-7 , xt-8 , xt-9 , xt-10 , xt-11
Basically, the number of input nodes considered for MLP, RBF and FM are, respectively 4, 7 and 12 nodes. For the neural network training process, the best hidden nodes is chosen based on the minimum root mean square errors (RMSE) computed for the training data. During modeling stage, the autocorrellation function of residuals is plotted to check for independence. The prediction of each model is evaluated using the root mean square error (RMSE), mean absolute error (MABE) and mean absolute relative error (MARE) stated in percentage.
The seasonal ARIMA model requires logarithmic transformation and seasonal difference to remove the non-stationarity in the net inflow series. As the net inflow have non-positive values, the lower bound is determined in the natural log transformation, zt = ln (y + a) . Where a is tested for several values until the skewness of zt is zero or almost zero; y is the original net inflow data; and zt is the transformed data. In this work the seasonal ARIMA of order (2,0,0) × (0,1,1)12 is adopted with the parameters of model are f1 = 0.1788 ; f2 = 0.1318; Q = 0.8084; and the residual variance is 0.1857.
Table 1 presents
the errors (RMSE, MABE and MARE) resulting from MLP, RBF, FM and ARIMA models.
The seasonal ARIMA model gives a higher error than the MLP, RBF and FM models.
It more obvious for the MARE. Results show that the neural network and adaptive
fuzzy can provide more accurate net inflow forecast than classical method,
multiplicative seasonal ARIMA. Analyses of errors reveal that although different
input nodes (4, 7 and 12 input nodes) were used in the modeling, the neural and
fuzzy display some flexibility to generalize their parameters and the results
are in good agreement. In general, the neural network and adaptive fuzzy are
adequate and reasonable methods for modeling of net inflow series.
Neural network and adaptive fuzzy are more flexible and more general than multiplicative seasonal ARIMA model. Both methods can learn, generalize and abstract the pattern of monthly net inflow series process easily with different patterns of input nodes. The present study shows that neural networks are capable of handling the hidden structure of net inflow series relationship. Studies have indicated that neural network and adaptive fuzzy models can provide a better performance in reservoir net inflow forecasting than seasonal ARIMA model. Obviously, the neural network and adaptive fuzzy models yield a lower forecasting error compared to seasonal ARIMA model.
References
[1]
Bishop C.S.. (1995). Neural networks for
pattern recognition. Oxford University Press, Great Clarendon Street, Oxford
OX2 6DP.
[2] Box G.E.P. and Jenkins G.M. (1976). Time series analysis forecasting and control. Holden-Day Inc, California.
[3] Fausett L. (1994). Fundamentals of neural networks. Prentice Hall, Englewood Cliffs, New Jersey, USA.
[4] Grino R. (1992). Neural networks for univariate time series forecasting and their application to water demand prediction. Neural Network World. (5): 437-450.
[5] Harun S., Kassim A.H. and Nguyen V.T.V. (1996). Inflow estimation with neural networks. Proc. of 10th Congress of Asian-Pacific Division, International Assoc. for Hydraulic Research, Langkawi, 26-29 August,1996 Vol 1, 150-155.
[6] Hsu K., Gupta H.V., and Sorooshian S. (1995). Artificial neural network modeling of the rainfall-runoff process. Water Resources Research. 31(10): 2517-2530.
[7] Karunithi N, Grenney W.J., Whitley D., and Bovee K. (1994). Neural networks for river flow prediction. J. Computing in Civ. Eng. 8(2): 201-220.
[8] McKerchar A.I. and Delleur J.W.. (1974). Application of seasonal parametric linear stochastic models to monthly flow data. Water Resources Research. 10(2): 246-255.
[9] Mohan S. and Vedula S.. (1995). Multiplicative seasonal ARIMA model for longterm forecasting of inflows. Water Resources Management. Kluwer Acad. Pub. (9): 15-126.
[10] M. S. Yaacob, H. Jamaluddin, S. Harun (2000). Adaptive Fuzzy Model for Inflow Forecasting, Jurnal Teknologi UTM. No 32: (submitted full paper)

Fig. 1 MLP univariate net inflow model

Fig. 2 RBF univariate net inflow model
Table 1 Results of the univariate net inflow forecasting models
|
|
|
Training |
Forecasting |
|
|
|
MODEL |
STRUCTURE |
RMSE (mcm) |
RMSE (mcm) |
MABE (mcm) |
MARE (%) |
|
MLP |
4-9-1* |
30.10 |
32.74 |
24.64 |
39.62 |
|
RBF-spline |
4 input nodes |
32.22 |
31.60 |
24.43 |
39.28 |
|
RBF-gaussian |
4 input nodes |
34.81 |
31.93 |
24.41 |
39.24 |
|
RBF-quadratic |
4 input nodes |
34.83 |
32.23 |
24.65 |
39.63 |
|
MLP |
7-3-1* |
32.08 |
32.21 |
22.89 |
36.81 |
|
RBF-spline |
7 input nodes |
32.73 |
30.86 |
22.32 |
35.89 |
|
RBF-gaussian |
7 input nodes |
33.29 |
31.14 |
23.18 |
37.27 |
|
RBF-quadratic |
7 input nodes |
33.08 |
30.60 |
22.22 |
35.72 |
|
MLP |
12-11-1* |
30.70 |
32.26 |
23.77 |
38.21 |
|
RBF-spline |
12 input nodes |
32.56 |
32.48 |
25.07 |
40.31 |
|
RBF-gaussian |
12 input nodes |
35.84 |
32.62 |
24.80 |
39.87 |
|
RBF-quadratic |
12 input nodes |
33.16 |
31.82 |
24.29 |
39.05 |
|
FM 4-12 |
4 input |
34.74 |
31.15 |
24.48 |
39.36 |
|
FM 7-12 |
7 input |
34.28 |
32.09 |
24.72 |
39.74 |
|
FM12-12 |
12 input |
30.35 |
34.08 |
25.95 |
41.72 |
|
ARIMA |
8 input |
34.05 |
34.76 |
26.87 |
73.23 |
RBF is radial basis function; MLP is
multilayer perceptron, FM is fuzzy model.
ARIMA is the autoregressive integrated
moving average.
* input nodes-hidden nodes-output
nodes; mcm - million cubic meter.