Missing data in wind farm time series: properties and effect on forecasts

Tawn, R., Browell, J. and Dinwoodie, I. (2020) Missing data in wind farm time series: properties and effect on forecasts. Electric Power Systems Research, 189, 106640. (doi: 10.1016/j.epsr.2020.106640)

Full text not currently available from Enlighten.


Missing or corrupt data is common in real-world datasets; this affects the estimation and operation of analytical models where completeness is assumed or required. Statistical wind power forecasts utilise recent turbine data as model inputs, and must therefore be robust to missing data. We find that wind power data is ‘missing not at random’, with missing patterns also related to the forecast output. Approaches for dealing with this missing data in training and operation are proposed and evaluated through a case study, leading to a suggested forecasting methodology in the presence of missing data. In the training set, missing data was found to have significant negative impact on performance if simply omitted but this can be almost completely mitigated using multiple imputation. Greater increase in forecast errors is seen when input data are missing operationally, and re-training forecast models using the remaining inputs is found to be preferable to imputation.

Item Type:Articles
Glasgow Author(s) Enlighten ID:Browell, Dr Jethro
Authors: Tawn, R., Browell, J., and Dinwoodie, I.
College/School:College of Science and Engineering > School of Mathematics and Statistics > Statistics
Journal Name:Electric Power Systems Research
ISSN (Online):1873-2046
Published Online:11 August 2020

University Staff: Request a correction | Enlighten Editors: Update this record