Semantic Scholar Open Access 2020 335 sitasi

Statistical and machine learning models in credit scoring: A systematic literature survey

X. Dastile T. Çelik M. Potsane

Abstrak

Abstract In practice, as a well-known statistical method, the logistic regression model is used to evaluate the credit-worthiness of borrowers due to its simplicity and transparency in predictions. However, in literature, sophisticated machine learning models can be found that can replace the logistic regression model. Despite the advances and applications of machine learning models in credit scoring, there are still two major issues: the incapability of some of the machine learning models to explain predictions; and the issue of imbalanced datasets. As such, there is a need for a thorough survey of recent literature in credit scoring. This article employs a systematic literature survey approach to systematically review statistical and machine learning models in credit scoring, to identify limitations in literature, to propose a guiding machine learning framework, and to point to emerging directions. This literature survey is based on 74 primary studies, such as journal and conference articles, that were published between 2010 and 2018. According to the meta-analysis of this literature survey, we found that in general, an ensemble of classifiers performs better than single classifiers. Although deep learning models have not been applied extensively in credit scoring literature, they show promising results.

Topik & Kata Kunci

Penulis (3)

X

X. Dastile

T

T. Çelik

M

M. Potsane

Format Sitasi

Dastile, X., Çelik, T., Potsane, M. (2020). Statistical and machine learning models in credit scoring: A systematic literature survey. https://doi.org/10.1016/j.asoc.2020.106263

Akses Cepat

PDF tidak tersedia langsung

Cek di sumber asli →
Lihat di Sumber doi.org/10.1016/j.asoc.2020.106263
Informasi Jurnal
Tahun Terbit
2020
Bahasa
en
Total Sitasi
335×
Sumber Database
Semantic Scholar
DOI
10.1016/j.asoc.2020.106263
Akses
Open Access ✓