Servicios Personalizados
Articulo
Indicadores
Links relacionados
Citado por Google
Similares en Google
Compartir
South African Journal of Science
versión On-line ISSN 1996-7489
versión impresa ISSN 0038-2353
Resumen
BREED, Douw G. y VERSTER, Tanja. The benefits of segmentation: Evidence from a South African bank and other studies. S. Afr. j. sci. [online]. 2017, vol.113, n.9-10, pp.1-7. ISSN 1996-7489. http://dx.doi.org/10.17159/sajs.2017/20160345.
We applied different modelling techniques to six data sets from different disciplines in the industry, on which predictive models can be developed, to demonstrate the benefit of segmentation in linear predictive modelling. We compared the model performance achieved on the data sets to the performance of popular non-linear modelling techniques, by first segmenting the data (using unsupervised, semi-supervised, as well as supervised methods) and then fitting a linear modelling technique. A total of eight modelling techniques was compared. We show that there is no one single modelling technique that always outperforms on the data sets. Specifically considering the direct marketing data set from a local South African bank, it is observed that gradient boosting performed the best. Depending on the characteristics of the data set, one technique may outperform another. We also show that segmenting the data benefits the performance of the linear modelling technique in the predictive modelling context on all data sets considered. Specifically, of the three segmentation methods considered, the semi-supervised segmentation appears the most promising. SIGNIFICANCE: • The use of non-linear modelling techniques may not necessarily increase model performance when data sets are first segmented. • No single modelling technique always performed the best. • Applications of predictive modelling are unlimited; some examples of areas of application include database marketing applications; financial risk management models; fraud detection methods; medical and environmental predictive models.
Palabras clave : predictive models; case studies; logistic regression; linear modelling; semi-supervised segmentation.
