Abstract
In this paper, we compare two different approaches to estimate the credit risk for small- and mid-sized businesses (SMBs), namely a classic parametric approach, by fitting an ordered probit model, and a non-parametric approach, calibrating a machine learning historical random forest (HRF) model. The models are applied to a unique and proprietary dataset comprising granular firm-level quarterly data collected from a European investment bank and an international insurance company on a sample of 464 Italian SMBs over the period 2015–2017. Results show that the HRF approach outperforms the traditional ordered probit model, highlighting how advanced estimation methodologies that use machine learning techniques can be successfully implemented to predict SMB credit risk, i.e. when facing high asymmetries of information. Moreover, by using Shapley values, we are able to assess the relevance of each variable in predicting SMB credit risk.
| Original language | English |
|---|---|
| Article number | 101746 |
| Number of pages | 9 |
| Journal | Socio-Economic Planning Sciences |
| Volume | 90 |
| Early online date | 8 Nov 2023 |
| DOIs | |
| Publication status | Published - Dec 2023 |
Data Availability Statement
The data that has been used is confidential.Funding
This research has received funding from the European Union’s Horizon 2020 research and innovation program “PERISCOPE: Pan European Response to the ImpactS of COvid-19 and future Pandemics and Epidemics” , under the Grant Agreement No. 101016233, H2020-SC1-PHE-CORONAVIRUS-2020-2-RTD.
| Funders | Funder number |
|---|---|
| European Research Council | 101016233, H2020-SC1-PHE-CORONAVIRUS-2020-2-RTD |
Fingerprint
Dive into the research topics of 'Machine learning and credit risk: Empirical evidence from small- and mid-sized businesses'. Together they form a unique fingerprint.Cite this
- APA
- Standard
- Harvard
- Vancouver
- Author
- BIBTEX
- RIS