Challenge: What is the future selling price of a home?

This challenge (adapted from Kaggle) aims at predicting selling prices of a number of houses. To predict these prices, we are given a training set (with known selling prices), and for each house, we are given around 79 characteristics (total area, building materials, number of floors, ...).

Our approach

We decided to really study the training dataset in depth before thinking about predicting. This data analysis aimed at understanding the statistics and correlation of the data features as well as what kind of features engineering and transforms need to be done before they are ready to be use for prediction. We spent quite a lot of time on analyzing and our data and perform features engineering.

We decide to use regression model for prediction. We choose different Regularizations and evaluate their performances using cross-validation. we also perform the Stacked Generalization models ensembling to avoid the regression overfiting problem.

Steps:

Descriptive statistics about the data
- skewness analysis
- Features Splitting: Categorical/Ordinal/Numerical, and statistic for each group
- features correlation
Data cleaning and pre-processing and features engineering
- missing data, outerliers, invalid data
- Outliers removal
- Features encoding
- Construct new features
- features normalization and rescaling
Training and evaluate baseline models
- Explore the effects of the different regularizations: apply Lasso, Ridge and ElasticNet.
- Using a grid search approach to find the best parameters.
Stack models to build final model
- XGboost
- Stacked generalization
Validate the outcome of the model

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
data		data
.DS_Store		.DS_Store
AML-Challenge- house price.ipynb		AML-Challenge- house price.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Challenge: What is the future selling price of a home?

Our approach

About

Releases

Packages

Languages

JZ-LIANG/Regression-for-House-Prices-Prediction

Folders and files

Latest commit

History

Repository files navigation

Challenge: What is the future selling price of a home?

Our approach

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages