Giter Site home page Giter Site logo

prat0101 / data-science-portfolio Goto Github PK

View Code? Open in Web Editor NEW
7.0 2.0 0.0 15.43 MB

Data Science Portfolio created for academic and personal projects.

Jupyter Notebook 100.00%
feature-engineering feature-scaling feature-selection gradient-boosting hyperparameter-tuning kmeans-clustering linear-regression performance-metrics python xgboost pandas sklearn

data-science-portfolio's Introduction

Data Science Projects Portfolio

1. Reduce the test bench time for vehicles

• Objective: Train the model to predict the test bench time from 378 categorical features.

• Feature engineering and Feature selection: Handled features with zero variance, handled multicollinearity and treated categorical features with a large number of categories.

• Model Training: Used Linear regression, Ridge regression, Gradient boosting, and XGBoost algorithms to train the model, and performed hyperparameter tuning to optimize performance metrics. Model used to select feature values for minimum testing time.

2.Customer Segmentation for Retail store

• Objective: Perform Customer segmentation using RFM analysis (Recency, Frequency, and Monetary value) to identify prominent customers in store.

• Exploratory data analysis (EDA): Performed Cohort analysis & built RFM segments

• Perform Clustering on RFM data: Outliers detection, selected feature scaling method, applied K means clustering algorithm on scaled data, used elbow method and Silhouette score to decide the optimum number of clusters.

• Data Visualization: Created dashboard in Tableau to show average sales in different countries, Top selling products, hourly sales, and a heatmap for RFM values.

3.Comparison of Regions based on sales (Data Visualization)

• Objective: To compare sales data between two regions using the Tableau dashboard and suggest necessary improvements to management.

• Created parameters for regions, shown the sum of sales for different products, shown variation of sales with respect to time, used maps to show states in different regions.

• Created a dashboard to compare sales characteristics of two different regions at a time.

4. Comcast Customer Project Details

• Objective: Data gives information about customers’ complaints received from different regions at different times of the year. Do the data analysis based on types of complaints, number of complaints, and region-based distribution of complaints. This will help the telecom service provider to take necessary actions for reducing the number of complaints.

• Tasks Performed: Using EDA Techniques in pandas library to for data analysis of registered complaints.

5. Marketing Mix Modeling

• Objective: To establish correlation between spend on marketing promotions and sales.

• Feature engineering and Feature selection: Log transformation, outliers’ detection, check for multicollinearity, feature scaling (Standardization).

• Model Training: Used Stats model, Linear regression, and XGBoost algorithms and performed hyperparameter tuning to optimize metrics. Generated Response curves.

• Data Visualization: Showing relation between Product, Price, Promotions and Places (4Ps) with Sales using Tableau dashboards. Optimizing spends on promotions.

data-science-portfolio's People

Contributors

prat0101 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.