Giter Site home page Giter Site logo

hcm-house's Introduction

HO CHI MINH CITY HOUSE PRICE PREDICTION PROJECT

Introduction

Preview

web_layout

Workflow

0. Structures

├───assets  (containing file for web layout design)
│       style.css
├───data chunk (containing separated data for each district)
│
├───data  (containing data for processing)
│
├── app.py        
├── crawl_data.ipynb
├── eda_cleaning.ipynb
├── feature_engineering_selection.ipynb
├── model.ipynb
├── final_model.sav
├── Procfile           
└── requirements.txt

1. Data Scraping

2. Data Cleaning

  • Drop duplicated values
  • Extracted and create new information from text desription of each house.
  • Correct wrong price and numeric value of observations
  • Correct missing values

3. Data Exploratory Analysis

  • Examine missing values
  • Analyze numerical variables and their distribution
  • Analyze categorical variables and their cardinality
  • Detect outliers
  • Analyze relationship between all the features of house and the house price

4. Feature Engineering

  • Remove outliers
  • Complete missing values
  • Transform numerical variables due to its skew distribution
  • Encode categorical variables for model building
  • Create new feature from heading title
  • Oversampling data
  • Cluster and PCA

5. Feature Selection

  • Drop redundant features
  • Remove highly correlated features
  • Examine features importance
  • Remove anomaly observations

6. Model Bulding and Tuning

  • Perform K-fold cross validation
  • Use Random Forest, XGB and LightGBM algorithm for training datasets
  • Perform RandomizedSearchCV for optimizing score

7. Web App Deployment

  • For app deployment, I using Dash to design and represent. I also using Heroku for hosting web app.

Conclusion

  • This project aims to help people to somewhat determine a price for their real estate to sell as well as to be able to determine if the houses they intend to buy are being sold for a reasonable price. However, above all, the main purpose of this project is to have a fun time when playing with machine learning.

hcm-house's People

Contributors

123olala avatar dependabot[bot] avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.