Giter Site home page Giter Site logo

Mrinal Gupta's Projects

aipnd icon aipnd

Code and associated files for the AI Programming with Python Nanodegree Program

credit-card-application-fraud-detection-using-supervised-machine-learning-models icon credit-card-application-fraud-detection-using-supervised-machine-learning-models

The provided dataset contained application (identity) fraud cases. It was a supervised problem as the data included a column showing the application’s fraud label (whether an application was fraudulent or not). It also contained several identifying data fields about the applicant such as SSN, address, phone number, etc. The dataset had 1,000,000 records and 10 data fields. We first described and visualized each of the 10 data fields and treated all frivolous values. Then we created 634 candidate variables and performed feature selection to reduce them to 30. Finally, we used a few different machine learning algorithms (both linear and nonlinear) to predict fraudulent applications records.

credit-card-transaction-fraud-detection-using-supervised-machine-learning-with-an-imbalanced-dataset icon credit-card-transaction-fraud-detection-using-supervised-machine-learning-with-an-imbalanced-dataset

Credit card fraud is a burden for organizations across the globe. Specifically, $24.26 billion were lost due to credit card fraud worldwide in 2018, according to shiftprocessing.com. In this project, our goal was to build an effective and efficient model to predict fraud. We analyzed a real-world dataset that contained a list of government related credit card transactions over the 2010 calendar year. The data presented a supervised problem as it included a column showing the transaction’s fraud label (whether a transaction was fraudulent or not). It also contained identifying information about each transaction such as the credit card number, merchant, merchant state, etc. The dataset had 96,753 records and 10 data fields. We first described and visualized each of the 10 data fields, cleaned the dataset, and filled in missing values. Then we created many variables and performed feature selection. Finally, we created a variety of machine learning models (both linear and nonlinear) and highlighted our results.

disaster-response-pipeline-using-natural-language-processing icon disaster-response-pipeline-using-natural-language-processing

This project contains a web app that asks for a message from a potential user who is in danger during a disaster and the app categorizes that message into a particular category such as aid related, weather-related, fire or many more using natural language processing and AdaBoost classifier.

dog-breed-classifier-using-cnn icon dog-breed-classifier-using-cnn

This repo contains a Dog breed classifier algorithm using deep learning. The main functions of this algorithm are that if a dog is detected in the image, it will provide an estimate of the dog's breed. If a human is detected, it will provide an estimate of the dog breed that is most resembling.

etl-pipelines icon etl-pipelines

Contains Machine learning pipeline and ETL pipeline notebooks in order to practice and learn their working.

inf-552 icon inf-552

HW submission for INF - 552 (Machine Learning for Data Science)

job_salary_prediction icon job_salary_prediction

A web application that predicts the salary of a job posting from job description and location using machine learning algorithms.

my-first-python-package icon my-first-python-package

This repo contains the files of a python package developed by me which automates the task of getting basic insights from data.

nyc-property-tax-record-fraud-detection-using-unsupervised-learning-models icon nyc-property-tax-record-fraud-detection-using-unsupervised-learning-models

This project analyzes New York City’s (NYC’s) real estate data to specifically identify property tax fraud. The main indicators of property tax fraud were property tax assessments that were too high or too low. Given a property dataset of 1,070,994 records and 32 data fields, we first described, visualized, and filled in missing values for each variable. Second, 45 additional variables were created in order to create the most accurate algorithm. Next, we used dimensionality reduction techniques to refine our dataset. Finally, we used (principal component analysis (PCA) and an autoencoder) to obtain two separate fraud scores. The scores were combined and then ranked to get a final fraud score.

recommendation-engine-with-ibm icon recommendation-engine-with-ibm

This repo contains my first hands-on experience in developing a Recommendation engine using IBM Watson Studio dataset. The goal is to recommend the articles to the user using varius types of Recommendation engines that I studied while pursuing my Data Science Nanodegree from Udacity.

salary-predictor icon salary-predictor

Deep learning model using NLP to predict job salary based on Indeed job postings

size-limit icon size-limit

Calculate the real cost to run your JS app or lib to keep good performance. Show error in pull request if the cost exceeds the limit.

sql-leetcode-challenge icon sql-leetcode-challenge

Contains all the 117 Leetcode questions with their solutions ranging from Easy to Hard in MySQL.

tensorflow icon tensorflow

An Open Source Machine Learning Framework for Everyone

usc-dso-570 icon usc-dso-570

Course material for the class DSO 570: The Analytics Edge (Data, Models and Effective Decisions) at USC Marshall School of Business

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.