Giter Site home page Giter Site logo

abdul-aa / revenue-radar Goto Github PK

View Code? Open in Web Editor NEW

This project forked from mcgill-mma-enterpriseanalytics/revenue-radar

0.0 0.0 0.0 84.5 MB

License: MIT License

Jupyter Notebook 99.95% Dockerfile 0.01% Python 0.02% HTML 0.01% Roff 0.03%
docker fastapi mlflow stacking-ensemble

revenue-radar's Introduction

Revenue Radar

Revenue Radar

Team Members ED2

  • Abdulrahman Aroworamimo
  • Angel Oluwole-Rotimi
  • Tashfeen Ahmed
  • Xingchen Luo
  • Yvan Kammelu

Architecture

Architecture

Cybersecurity

In developing Revenue Radar for Productionization, we considered the potential cybersecurity threats and vulnerabilities crucial to ensuring the security and integrity of the application and its data. We propose two cybersecurity applications that can address attacks against machine learning systems and common web application vulnerabilities listed by OWASP.

WAF (Web Application Firewall) - AWS WAF or Cloudflare WAF

  • Justification:
    • Protection Against OWASP Threats: A WAF can provide rules to defend against OWASP threats like SQL Injection, XSS (Cross-Site Scripting), and broken authentication. It can filter, monitor, and block HTTP traffic to and from a web application, which helps protect against attacks that exploit application vulnerabilities.
    • Customizable Rulesets for ML Security: With WAFs, you can create custom rules that specifically protect against abnormal, malicious HTTP requests that might indicate attempts at evasion or impersonation attacks on ML models. By monitoring the nature and pattern of the requests, WAFs can help mitigate potential poisoning attacks by blocking requests that try to upload malicious data.

Team Members ED1

  • Abdulrahman Aroworamimo
  • Ammad Sohail
  • Angel Oluwole-Rotimi
  • Rodrigo Castro
  • Xingchen Luo
  • Yvan Kammelu

Dataset Overview

Our dataset contains transaction data from individual visits to the GStore between December 1st, 2018, and January 31st, 2019. Each line in our dataset represents a visit to the store.

The original dataset is available here: https://www.kaggle.com/competitions/ga-customer-revenue-prediction/data?select=train_v2.csv

Project Goals

For our project, we explore two major use cases related to customer conversion and revenue generated from conversions:

  1. Customer Conversion Analysis: To identify trends in user behavior that lead to conversions, enabling targeted marketing strategies.
  2. Revenue Optimization: To understand how conversions translate into revenue, providing insights for optimizing ad spend and marketing investments.

Our project aims to explore trends in user conversion, which can drive and improve our investment in marketing and ad spend for GStore, as well as any other retailers looking to leverage customer data to enhance their marketing spend.

Let's dive into the details of each use case for Revenue Radar.


Customer Conversion (Use Case 1)

Conversion is a key indicator to understand and optimize, as the number of people purchasing and supporting merchandise maps directly to branding goals.

Customer Conversion

Objectives

  • Determine which users to target for conversion nudges.
  • Increase understanding of the user journey and profile of converting customers.

Measuring Performance

Model Performance

  • Model performance should be measured with F1 Score for an appropriate balance of precision and recall.

Initiative Performance

  • The model must outperform the current rule-based process to be piloted.

Explainable models must be explored throughout the data science process.

Data Transformations

To facilitate user-centric analysis, the dataset was transformed from session level to user level.

  • Page Views -> First Session Page Views, Last Session Page Views
  • Device Category -> Number of visits by desktop, mobile, tablet

Columns were excluded based on proportions of missing data and SME (Subject Matter Expert) consultation.

See the code file in the respective labeled folder for more details on preprocessing.


Transaction Revenue (Use Case 2)

Optimizing revenue is critical for sustaining operations and funding the product innovation required for competitiveness in merchandising.

Transaction Revenue

Objectives

  • Identify which customers are likely to spend more at GStore, focusing on understanding their spending behaviors and patterns.
  • Determine the key factors contributing to higher customer spending, enabling targeted marketing strategies and product innovation.

Measuring Performance

Model Performance

  • Adopted Mean Absolute Error (MAE) as our primary metric to evaluate the accuracy of the regression model in predicting customer spending.

Data Transformations

To facilitate transaction-level analysis, the dataset was filtered for sessions with transaction revenue == 0.

Columns were excluded based on proportions of missing data and use case relevance.

See the code file in the respective labeled folder for more details on preprocessing.


Data Dictionary

Originally stored as a JSON file, after transformation, these were the columns available in the dataset:

  • channelGrouping
  • date
  • fullVisitorId
  • sessionId
  • socialEngagementType
  • visitId
  • visitNumber
  • visitStartTime
  • continent
  • subContinent
  • country
  • region
  • metro
  • city
  • cityId
  • networkDomain
  • latitude
  • longitude
  • networkLocation
  • browser
  • browserVersion
  • browserSize
  • operatingSystem
  • operatingSystemVersion
  • isMobile
  • mobileDeviceBranding
  • mobileDeviceModel
  • mobileInputSelector
  • mobileDeviceInfo
  • mobileDeviceMarketingName
  • flashVersion
  • language
  • screenColors
  • screenResolution
  • deviceCategory
  • visits
  • hits
  • pageviews
  • bounces
  • newVisits
  • transactionRevenue
  • campaign
  • source
  • medium
  • keyword
  • adwordsClickInfo.criteriaParameters
  • isTrueDirect
  • referralPath
  • adwordsClickInfo.page
  • adwordsClickInfo.slot
  • adwordsClickInfo.gclId
  • adwordsClickInfo.adNetworkType
  • adwordsClickInfo.isVideoAd
  • adContent
  • campaignCode

revenue-radar's People

Contributors

abdul-aa avatar ammadsohail99 avatar aoluwolerotimi avatar rodrigo20acc avatar tashfeenahmed12 avatar tiger7789 avatar ykamm avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.