Get 20M+ Full-Text Papers For Less Than $1.50/day. Subscribe now for You or Your Team.

Learn More →

Prediction of credit risk with an ensemble model: a correlation-based classifier selection approach

Prediction of credit risk with an ensemble model: a correlation-based classifier selection approach Ensemble models that combine multiple base classifiers have been widely used to improve prediction performance in credit risk evaluation. However, an arbitrary selection of base classifiers is problematic. The purpose of this paper is to develop a framework for selecting base classifiers to improve the overall classification performance of an ensemble model.Design/methodology/approachIn this study, selecting base classifiers is treated as a feature selection problem, where the output from a base classifier can be considered a feature. The proposed correlation-based classifier selection using the maximum information coefficient (MIC-CCS), a correlation-based classifier selection under the maximum information coefficient method, selects the features (classifiers) using nonlinear optimization programming, which seeks to optimize the relationship between the accuracy and diversity of base classifiers, based on MIC.FindingsThe empirical results show that ensemble models perform better than stand-alone ones, whereas the ensemble model based on MIC-CCS outperforms the ensemble models with unselected base classifiers and other ensemble models based on traditional forward and backward selection methods. Additionally, the classification performance of the ensemble model in which correlation is measured with MIC is better than that measured with the Pearson correlation coefficient.Research limitations/implicationsThe study provides an alternate solution to effectively select base classifiers that are significantly different, so that they can provide complementary information and, as these selected classifiers have good predictive capabilities, the classification performance of the ensemble model is improved.Originality/valueThis paper introduces MIC to the correlation-based selection process to better capture nonlinear and nonfunctional relationships in a complex credit data structure and construct a novel nonlinear programming model for base classifiers selection that has not been used in other studies. http://www.deepdyve.com/assets/images/DeepDyve-Logo-lg.png Journal of Modelling in Management Emerald Publishing

Prediction of credit risk with an ensemble model: a correlation-based classifier selection approach

Journal of Modelling in Management , Volume 17 (4): 20 – Nov 29, 2022

Loading next page...
 
/lp/emerald-publishing/prediction-of-credit-risk-with-an-ensemble-model-a-correlation-based-TLuOGQKX8i

References (46)

Publisher
Emerald Publishing
Copyright
© Emerald Publishing Limited
ISSN
1746-5664
eISSN
1746-5664
DOI
10.1108/jm2-09-2020-0235
Publisher site
See Article on Publisher Site

Abstract

Ensemble models that combine multiple base classifiers have been widely used to improve prediction performance in credit risk evaluation. However, an arbitrary selection of base classifiers is problematic. The purpose of this paper is to develop a framework for selecting base classifiers to improve the overall classification performance of an ensemble model.Design/methodology/approachIn this study, selecting base classifiers is treated as a feature selection problem, where the output from a base classifier can be considered a feature. The proposed correlation-based classifier selection using the maximum information coefficient (MIC-CCS), a correlation-based classifier selection under the maximum information coefficient method, selects the features (classifiers) using nonlinear optimization programming, which seeks to optimize the relationship between the accuracy and diversity of base classifiers, based on MIC.FindingsThe empirical results show that ensemble models perform better than stand-alone ones, whereas the ensemble model based on MIC-CCS outperforms the ensemble models with unselected base classifiers and other ensemble models based on traditional forward and backward selection methods. Additionally, the classification performance of the ensemble model in which correlation is measured with MIC is better than that measured with the Pearson correlation coefficient.Research limitations/implicationsThe study provides an alternate solution to effectively select base classifiers that are significantly different, so that they can provide complementary information and, as these selected classifiers have good predictive capabilities, the classification performance of the ensemble model is improved.Originality/valueThis paper introduces MIC to the correlation-based selection process to better capture nonlinear and nonfunctional relationships in a complex credit data structure and construct a novel nonlinear programming model for base classifiers selection that has not been used in other studies.

Journal

Journal of Modelling in ManagementEmerald Publishing

Published: Nov 29, 2022

Keywords: Risk analysis; Nonlinear programming; Modeling; Data analytics; Credit risk prediction; Ensemble model; Correlation-based classifier selection; Maximum information coefficient; Nonlinear optimization

There are no references for this article.