skip to main content
Language:
Search Limited to: Search Limited to: Resource type Show Results with: Show Results with: Search type Index

Machine Learning for Credit Scoring: Improving Logistic Regression with Non Linear Decision Tree Effects

European journal of operational research, 2022-03, Vol.297 (3), p.1178-1192 [Peer Reviewed Journal]

Distributed under a Creative Commons Attribution 4.0 International License ;ISSN: 0377-2217 ;EISSN: 1872-6860 ;DOI: 10.1016/j.ejor.2021.06.053

Digital Resources/Online E-Resources

Citations Cited by
  • Title:
    Machine Learning for Credit Scoring: Improving Logistic Regression with Non Linear Decision Tree Effects
  • Author: Dumitrescu, Elena Ivona ; Hué, Sullivan ; Hurlin, Christophe ; Tokpavi, Sessi
  • Subjects: Economics and Finance ; Humanities and Social Sciences
  • Is Part Of: European journal of operational research, 2022-03, Vol.297 (3), p.1178-1192
  • Description: In the context of credit scoring, ensemble methods based on decision trees, such as the random forest method, provide better classification performance than standard logistic regression models. However, logistic regression remains the benchmark in the credit risk industry mainly because the lack of interpretability of ensemble methods is incompatible with the requirements of financial regulators. In this paper, we propose a high-performance and interpretable credit scoring method called penalised logistic tree regression (PLTR), which uses information from decision trees to improve the performance of logistic regression. Formally, rules extracted from various short-depth decision trees built with original predictive variables are used as predictors in a penalised logistic regression model. PLTR allows us to capture non-linear effects that can arise in credit scoring data while preserving the intrinsic interpretability of the logistic regression model. Monte Carlo simulations and empirical applications using four real credit default datasets show that PLTR predicts credit risk significantly more accurately than logistic regression and compares competitively to the random forest method
  • Publisher: Elsevier
  • Language: English
  • Identifier: ISSN: 0377-2217
    EISSN: 1872-6860
    DOI: 10.1016/j.ejor.2021.06.053
  • Source: Hyper Article en Ligne (HAL) (Open Access)

Searching Remote Databases, Please Wait