Customers Default Payments

In this project I used the support vector machine to see how many customers are not in default and how many are in default

Languages

About Dataset

The dataset used for the analysis has been downloaded by Machine Learning UCI repository. This research aimed at the case of customers default payments in Taiwan and compares the predictive accuracy of probability of default among six data mining methods. this dataset is composed by 30000 rows and 24 columns:

Limit_bal: credit limit (numeric)
Sex: customer's sex (categorical)
Marriage: civil status (categorical)
Age: customer's age (numeric)
Pay_: historical past payaments from April to September 2005 (categorical)
Bill_AMT: amount statement from April to September 2005 (numeric)
Pay_amt: past payament amount (numeric)
Default: payment, response variable (categorical)

About Analysis

First step to begin the analysis is check null values and cleaning the dataset. Next step has been to balance the observations in the training and test set and I standardized the attributes then I used nested cross validation to estimate generalization error and hyper-parameter simultaneously.
In the last step, I used the Support Vector Machine to my dataset divided into training and test set.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
charts		charts
data		data
README.md		README.md
RizzitelloSVM.ipynb		RizzitelloSVM.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Customers Default Payments

About Dataset

About Analysis

About

Releases

Packages

Languages

NicolaRizzitello/Customers-default-payments

Folders and files

Latest commit

History

Repository files navigation

Customers Default Payments

About Dataset

About Analysis

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages