Servicios Personalizados
Revista
Articulo
Indicadores
Links relacionados
Citado por Google
Similares en Google
Compartir
Journal of Energy in Southern Africa
versión On-line ISSN 2413-3051versión impresa ISSN 1021-447X
Resumen
CLOHESSY, Chantelle; BRETTENNY, Warren y ABRAHAMS, Waldo. Outlier detection in ground-measured solar resource data using statistical classification models. J. energy South. Afr. [online]. 2025, vol.36, n.1, pp.1-14. ISSN 2413-3051. https://doi.org/10.17159/2413-3051/2025/v36i1a20742.
Ground-based solar resource measurements are known to be preferred to synthetic or simulated data for a given location, but outliers present in this data can significantly impact the accuracy of predictions used in viability assessments. For solar energy installations to be self-sustaining and viable, accurate ground-based solar resource data for the location of these installations are essential for decision-making and planning. Conventional outlier detection techniques used for solar resources, including graphical plots to complex numerical approaches, often have difficulty identifying these outliers to a satisfactory degree. This study proposes the use of simulated outliers added to synthetic data to train and compare the effectiveness of traditional outlier detection methods and several statistical learning methods, including kNN, naïve Bayes, support vector machines and advanced tree-based models for the purpose of outlier detection in this field. The results indicate that the advanced tree-based models provide accurate identification of outliers in the simulation step and are demonstrated to be effective on a ground-based real world data set collected in Gqeberha, South Africa. The use of the proposed approach can aid in reducing the uncertainty in measured solar resource data and, as a result, help to promote the use of solar energy solutions in areas with unreliable solar resource data. HIGHLIGHTS: • Simulation method proposed to leverage the use of supervised classification algorithms for outlier detection. • Tree-based models achieved >99% cross validated accuracy in simulated data. • Case study for South African solar resource showed promising result. • Improved accuracy for outlier detection compared to current methods.
Palabras clave : outlier detection; solar resource assessment; statistical learning.












