Journal of Risk Model Validation

Risk.net

An end-to-end deep learning approach to credit scoring using CNN + XGBoost on transaction data

Lars Ole Hjelkrem, Petter Eilif de Lange and Erik Nesset

  • Open Banking APIs can prove to be an important data source for banks when assessing the creditworthiness of potential customers using application credit score models and have the potential to increase the profitability of banks when recruiting new customers.
  • We find that traditional regression models perform poorly, while machine learning methods can generate credit score models with satisfactory performance based on transaction data solely from the last 90-days before the score date.
  • The best-performing machine learning models are based on an end-to-end deep learning approach, where the machine learning algorithms create the explanatory variables based on non-aggregated data.
  • This result is in accordance with experiments in other scientific fields where deep learning has replaced shallow learning as state-of-the- art.

The performance of credit scoring models is closely linked to a bank’s profitability. Application scoring models for potential customers usually perform worse than models for existing customers. This is due to the lender not having access to the financial behavioral data of potential customers. Access to such data about potential customers could therefore increase a bank’s profitability. Open banking application programming interfaces (APIs) provide access to 90 days of historical data on potential customers’ balances and transactions. We examine the performance of credit scoring models developed using such data from a Norwegian bank. We find that traditional regression models perform poorly, while machine learning (ML) methods can provide models that perform satisfactorily based on these data alone. Further, we find that the best performing models are based on an end-to-end deep learning approach, where machine learning algorithms create explanatory variables based on non-aggregated data. These results indicate that data available through the open banking APIs can be an important data source when banks assess the creditworthiness of potential customers. In combination with end-to-end deep learning methods they have the potential to increase the performance of a bank’s application credit scoring models and thus increase the bank’s profitability.

Sorry, our subscription options are not loading right now

Please try again later. Get in touch with our customer services team if this issue persists.

New to Risk.net? View our subscription options

You need to sign in to use this feature. If you don’t have a Risk.net account, please register for a trial.

Sign in
You are currently on corporate access.

To use this feature you will need an individual account. If you have one already please sign in.

Sign in.

Alternatively you can request an individual account here