Vista completa de documento

Nº Sistema 000470761
Autor LinkAlonso-Robisco, Andres
Autor LinkCarbó, José Manuel
Título Understanding the performance of machine learning models to predict credit default [Recurso electrónico] : a novel approach for supervisory evaluation / Andrés Alonso and José Manuel Carbó.
Datos publicación LinkMadrid : Banco de España, 2021.
Descrip. física 44 p.
Serie LinkDocumentos de Trabajo / Banco de España ; 2105 ;ISSN:1579-8666
Resumen In this paper we study the performance of several machine learning (ML) models for credit default prediction. We do so by using a unique and anonymized database from a major Spanish bank. We compare the statistical performance of a simple and traditionally used model like the Logistic Regression (Logit), with more advanced ones like Lasso penalized logistic regression, Classification And Regression Tree (CART), Random Forest, XGBoost and Deep Neural Networks. Following the process deployed for the supervisory validation of Internal Rating-Based (IRB) systems, we examine the benefits of using ML in terms of predictive power, both in classification and calibration. Running a simulation exercise for different sample sizes and number of features we are able to isolate the information advantage associated to the access to big amounts of data, and measure the ML model advantage. Despite the fact that ML models outperforms Logit both in classification and in calibration, more complex ML algorithms do not necessarily predict better. We then translate this statistical performance into economic impact. We do so by estimating the savings in regulatory capital when using ML models instead of a simpler model like Lasso to compute the risk-weighted assets. Our benchmark results show that implementing XGBoost could yield savings from 12.4% to 17% in terms of regulatory capital requirements under the IRB approach. This leads us to conclude that the potential benefits in economic terms for the institutions would be significant and this justify further research to better understand all the risks embedded in ML models. [Resumen de autor] [eng]
Restricciones Acceso público y gratuito a la versión electrónica en Internet
Acceso electrónico  Ver en el Repositorio Institucional. 
Relacionado con International Review of Financial Analysis, v. 84, November 2022, 102372
Clasificación LinkC3-Métodos Econométricos y Estadísticos. 
LinkC4-Modelización econométrica. 
LinkR81-Big data e inteligencia artificial. 
LinkG2-Sistemas bancarios y actividad crediticia. 
Entidad secundaria LinkBanco de España

2013 Banco de España, Madrid, España. Reservados todos los derechos
Basado en Ex Libris (© 2009 Ex Libris)

Contacto