Giter Site home page Giter Site logo

Hi There!πŸ‘‹

About Me

I am Nitin Joseph. I am a self-taught ML, NLP (Natural Language programming) & python developer focusing on Marketing, Ecommerce, Retail & Analytics with deep skills in SEO, Content strategy & Martech. I can be contacted at

[email protected]

I have degrees in Quantitative disciplines and Liberal arts. I embrace STEAM mindset πŸ™Œ

One thing led to another when 5 years ago, I started having some fun with Python; as Feynman said if you want to master something, teach it; I volunteered to teach in an online boot camp for Afghan students. If any, lessons were mine to learn in dedication, application, and hope. Important sentiment where the world found itself in those times.

Measuring the impact of Digital media is often a straight shoot, but OOH marketing initiatives with barely any digital trail are painful ways of torturing data and yourself. Curious about a client's marketing initiative, I used Multiple Linear Regression seeking coefficients to explain various initiatives' impact. Minitab & SPSS (statistical software) are easy-to-use statistical tools, but I was soon faced with questions such as relaxing the assumptions, regularisation, and assumption of normality of data.

Then things converged, Statistics, coding & domain expertise. Borderline alchemy and a powerful alembic

Insights are cute, out-of-sights are sexy!

Buzz around ML generally veers from one extreme to another, a magic formula; throw at it a bunch of data and hope it gives you what you want. The other reduces ML models to their outcomes - prediction & classification. Later a good summarization but misses the essence.

Accuracy of Models to predict and classify are no doubt important, but if you are like me, who values questions as much as answers, the journey of getting there is paved with a path of insights and out-of-sights.

Marketing Science Projects

In respecting privacy & client confidentiality, I have used public datasets.

1. Understand why customers churn & then predict who will.

We use Random Forest Classifier to grade factors that have the most impact on churn rate and use the model to predict customers likely to churn. Checkout the Project

2. Facebook Prophet to forecast Demand & quantify the affect of seasonality, weekends and holidays

Prophet’s advantage are β€” an analyst-in-the-loop forecaster with human-interpretable parameters, easily applied to business analytics use cases. It's however getting long in the tooth and more recent developments with models like Neural Prophet & Nixtla/statsforecast are more appealing by the day. Checkout the Project

3. Understand consumer preference with Conjoint analysis

Which involves analyzing user preference survey data, applying methods to determine how users weigh each attribute and using data to predict and map out what consumers like and dislike about a product. Checkout the Project

4. E-commerce Market Basket Analysis to make a product recommendation & calculate CLV (Customer Lifetime Value)

  • Utilize two popular algorithms to create product recommendations and perform market basket analysis. I use collaborative filtering and a priori algorithm and implement both to generate product recommendations using first-party sales data. Checkout the Project

  • Perform RFM (Recency, Frequency & Monetary Value) analysis and calculate CLV using BG|NBD Model. Checkout the Project

  • Use unsupervised ML methods to segment customers into groups and find their key characteristics as it relates to business. This can then be used downstream in business and marketing process to better serve customers and improve outcomes for business and customers alike. Checkout the Project

5. Boost Campaign Conversion Rate with Decision Tree

Leverage Decision tree to improve to identify a subset of users with a higher likelihood of conversion works with digital and non-digital datasets. Checkout the Project

6. Marketing Modeling Mix with Facebook Robyn

This measurement model provides a big-picture view of all marketing efforts. This allows us to understand the incrementality of the entire cluster of marketing spend (Online, offline, brand, performance, email, town criers, whatever we spend money on). Oh, and we can then split by channel and get incremental Sales, CPS, google, TV, Meta, Print, and everything else.

7. Measuring Incrementality with Geo Lift?

Difficult problem to solve but thanks to Bayesian statistic this helps us get a grip on incrementality! Helps identify the results the company would NOT have received but for Marketing. Owned. Earned. Paid. Read why measuring incrementality is important Think with Google Checkout the Project

NLP For SEO, Content Optimization & Customer insight

1. Content Similarity checker.

Most content similarity checkers available online use TextRank/Frequency checker or some levenshtein distance to measure the distance between two strings. In my project, I utilize Goose3 to scrape all URLs that are inputted as a list and provide similarity scores from BERT using Hugging Face transformer. BERT measures semantic relationship. Checkout the Project

2. Auto Summarizer with Google T5.

This is a project that I built to provide a robust summarizer for Meta descriptions using Google T5. Plugin in your Url's and it will output meta descriptions. Google T5 is a generative summarizer. End project use was to auto summarize Meta description and consumer reviews. Checkout the Project

Nitin Joseph's Projects

building-ecommerce-rfm-model-with-segementation icon building-ecommerce-rfm-model-with-segementation

The β€œRFM” in RFM analysis stands for recency, frequency and monetary value. RFM analysis is a way to use data based on existing customer behavior to predict how a new customer is likely to act in the future.

data-driven-customer-segementation icon data-driven-customer-segementation

For any marketing campaign, it's critical to understand different behaviours, types and interests of Customers. Especially in targeted marketing, understanding and categorizing customers is an essential step for effective marketing strategies.

data-prepration-for-ml icon data-prepration-for-ml

Most machine learning algorithms require data to be formatted in a very specific way, so datasets generally require preparation before they can yield useful insights. This repository is to document my study notes as I work through steps that I have personally found most challenging.

feature-selection-exploratory-data-analysis icon feature-selection-exploratory-data-analysis

Effort here is to identify important features in relation to our target variable using multiple Correlation methods based on data type. Feature selection is important for ML models to avoid 'curse of dimensionality' but for this dataset we will be using it build our intution that benefits our later EDA effort

google-causal-impact-inferring-the-effect-of-an-event icon google-causal-impact-inferring-the-effect-of-an-event

The Causal Impact model lets you examine ecommerce and marketing time series data to understand whether changes have led to a statistically significant performance improvement. Here's how to use PyCausalImpact to analyse changes in marketing activity or in this case on Boeing stock price

improving-conversion-rate-with-decision-trees icon improving-conversion-rate-with-decision-trees

Use Decison Tree on Bank Marketing Dataset to Identify Customer attributes that are drivers of Conversion. We will then interpret these trained decison tree models by visualizing them using the graphviz package in python.

logistic-regression icon logistic-regression

Repo contains my personal Machine Learning projects with emphasis on explanability and Insights that are relevant for various stake holders in a business.

marketing-mix-modeling-mmm-using-robyn icon marketing-mix-modeling-mmm-using-robyn

A New GeneratUsing Robyn aims to reduce human bias in the modeling process, esp. by automating modelers decisions like adstocking, saturation, trend & seasonality as well as model validation. Moreover, the budget allocator & calibration enable actionability and causality of the results

multilabel-segementation-using-xgboost icon multilabel-segementation-using-xgboost

For this Multi-class classification dataset we will be using Xgboost, multi:softprob objective which is a standard alternative to binary:logistic when the dataset includes multiple classes. It computes the probabilities of classification and chooses the highest one.

predicting-customer-lifetime-value icon predicting-customer-lifetime-value

In Marketing, the CLV is one of the key metrics to have and monitor. The CLV measures customer's total worth to the business over the course of their lifetime relationship to a business. This metric is especially important to keep track of for acquiring new customers.

predicting-defaulters-using-artificial-neural-network icon predicting-defaulters-using-artificial-neural-network

ANN or Deep Learning can be utilized for many areas of marketing. Using Neural network models by BrainMaker, Microsoft increased its direct mail response rate my 4.9% to 8.2%. This helped Microsoft bring same amount of revenue for 35% les cost.

scrapy-crawl-spider icon scrapy-crawl-spider

Project using Scrapy spider with Rules that crawls Website/Pages of Interest and extracts Product info, pagination is dealt by configuring Link Extractor method

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.