Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/1458
Title: Predicting monthly streamflow using data-driven models coupled with data-preprocessing techniques
Authors: Wu, C. L.
Chau, Kwok-wing
Li, Y. S.
Subjects: Monthly streamflow forecast
Distributed support vector regression
Reconstruction of dynamics
Singular spectrum analysis
False nearest neighbors
Moving average
Artificial neural networks
Hydrology
Hydrological models
Issue Date: 25-Aug-2009
Publisher: American Geophysical Union
Source: Water Resources Research, Aug. 2009, v. 45, W08432.
Abstract: In this paper, the accuracy performance of monthly streamflow forecasts is discussed when using data-driven modeling techniques on the streamflow series. A crisp distributed support vectors regression (CDSVR) model was proposed for monthly streamflow prediction in comparison with four other models: autoregressive moving average (ARMA), K-nearest neighbors (KNN), artificial neural networks (ANNs), and crisp distributed artificial neural networks (CDANN). With respect to distributed models of CDSVR and CDANN, the fuzzy C-means (FCM) clustering technique first split the flow data into three subsets (low, medium, and high levels) according to the magnitudes of the data, and then three single SVRs (or ANNs) were fitted to three subsets. This paper gives a detailed analysis on reconstruction of dynamics that was used to identify the configuration of all models except for ARMA. To improve the model performance, the data-preprocessing techniques of singular spectrum analysis (SSA) and/or moving average (MA) were coupled with all five models. Some discussions were presented (1) on the number of neighbors in KNN; (2) on the configuration of ANN; and (3) on the investigation of effects of MA and SSA. Two streamflow series from different locations in China (Xiangjiaba and Danjiangkou) were applied for the analysis of forecasting. Forecasts were conducted at four different horizons (1-, 3-, 6-, and 12-month-ahead forecasts). The results showed that models fed by preprocessed data performed better than models fed by original data, and CDSVR outperformed other models except for at a 6-month-ahead horizon for Danjiangkou. For the perspective of streamflow series, the SSA exhibited better effects on Danjingkou data because its raw discharge series was more complex than the discharge of Xiangjiaba. The MA considerably improved the performance of ANN, CDANN, and CDSVR by adjusting the correlation relationship between input components and output of models. It was also found that the performance of CDSVR deteriorated with the increase of the forecast horizon.
Rights: Copyright 2009 American Geophysical Union.
Reproduced/modified by permission of American Geophysical Union.
Type: Journal/Magazine Article
URI: http://hdl.handle.net/10397/1458
DOI: 10.1029/2007WR006737
ISSN: 0043-1397
Appears in Collections:CEE Journal/Magazine Articles

Files in This Item:
File Description SizeFormat 
WRR.pdfPre-published version1.21 MBAdobe PDFView/Open


All items in the PolyU Institutional Repository are protected by copyright, with all rights reserved, unless otherwise indicated. No item in the PolyU IR may be reproduced for commercial or resale purposes.