Journal of Credit Risk

The effect of training set selection when predicting defaulting small and medium-sized enterprises with unbalanced data

Giovanna Menardi and Nicola Torelli


We focus on classification methods to separate defaulting small and medium sized enterprises from nondefaulting ones. In this framework, a typical problem occurs because the proportion of defaulting firms is very close to zero, leading to a class imbalance. Moreover, a form of bias may affect the classification because models are often estimated on samples of large corporations that are not randomly selected. We investigate how different criteria of sample selection may affect the accuracy of the classification and how this problem is strongly related to class imbalance.

Sorry, our subscription options are not loading right now

Please try again later. Get in touch with our customer services team if this issue persists.

New to View our subscription options

You need to sign in to use this feature. If you don’t have a account, please register for a trial.

Sign in
You are currently on corporate access.

To use this feature you will need an individual account. If you have one already please sign in.

Sign in.

Alternatively you can request an individual account here