Skip to the content.

Bank Loan Default Risk Analysis

Project Objective

With this case study, we aims to understand the strong driving factors behind Loan Defaults.

A loan lending company can use this case study to ensure that the consumers capable of repaying the loan are not rejected, and certain adaptable actions can be taken on client to basis if they are identified to face in difficulty paying their installments in future.

The result of this Risk analysis would help the bank to identify the patterns, which indicate if a client has difficulty paying their installments.

This can further influence the decisions such as denying the loan, reducing the amount of loan, lending (to risky applicants) at a higher interest rate, etc.

A Loan Lending Company can utilize this case adjective for its portfolio and risk assessment.

Business Understanding:

The primary business of any bank revolves around managing the spread between the deposits. In other words, when the interest that a bank earns from loans is greater than the interest it pays on deposits, it generates income from the interest rate spread. image

"The only good loan is one that gets paid back."

Clearly, the major part of revenues of any bank is attained through the loans they give to the people. But there are chances that the loans may not be paid back by few of the customers, making it a bad loan.

Business Scenario:

When a customer applies for a loan, there are four types of decisions that could be taken by the customer/company:

1. Approved: The Company approved the loan Application.

2. Cancelled: The client cancelled the application sometime during approval.

3. Refused: The company rejected the loan.

4. Unused offer: Loan has been cancelled by the client but on different stages of the process.

Business Profitability:

The insufficient or non-existent credit history of an Urban Customer puts the bank lending company in position of dilemma about lending the loan. This dilemma revolves around the likelihood that a customer would pay back the loan or not, and can potentially result in 2 types of loss:

1. Credit Loss: If an applicant is not likely to repay the loan, then approving the loan may lead to a financial loss for the company.

2. Interest Loss: If an applicant is likely to repay the loan, then not approving the loan results in a loss of business to the company.

image

Approach:

Dataset Information and overall data information check

We used two set of Datasets,

  1. application_data.csv, containing all the information of the client at the time of application. The data is about whether a client has payment difficulties. This further includes Data related to applicant’s socio economic status.
  2. previous_application.csv contains information about the client’s previous loan data. It includes information that whether the previous application had been Approved, Cancelled, Refused or Unused offer.

Data Cleansing

Next, we inititated the Data Cleansing process which included Null Value identification and strategic data modification, Data Type Conversion, and Outlier Treatment recommendation. After we were confident of the data we had in hand, we wanted to analyse few numerical data after Binning them in acceptable intervals.

Exploratory Data Analysis

We initiated the EDA process with a sole study of application dataset. It included Univariate Analysis , Bivariate Analysis , Correlation Heat Map , Identification of top 10 most Correlated Variables.

With this we targeted to identify the pattern and data of applications.

Next, we wanted to do a Comparative Study between the current and previous application dataset. We started with merging both the dataset, and then analysed the variations.

Throughout the study, Defaulter collumn (variable) is used as target variable.

Insights

image The ratio of Non-Defaulters to Defaulters is very high, giving Imbalance ratio of = .0878, because of this we used log scale to plot the curve to better understand the variations.

Final Recommendations to Loan Lending Company

Likely Defaulter

The Below listed Variables are potential identifier for a customer to be a Likely Defaulter

Likely Non-Defaulter

The Below listed Variables are potential identifier for a customer to be a Likely Non-Defaulter