mrinal1704 Goto Github PK
Name: Mrinal Gupta
Type: User
Company: Capital One
Bio: Senior Data Scientist at Capital One
Location: Washington, DC
Blog: mrinalusc.wordpress.com
Name: Mrinal Gupta
Type: User
Company: Capital One
Bio: Senior Data Scientist at Capital One
Location: Washington, DC
Blog: mrinalusc.wordpress.com
Code and associated files for the AI Programming with Python Nanodegree Program
A curated list of awesome READMEs
The provided dataset contained application (identity) fraud cases. It was a supervised problem as the data included a column showing the application’s fraud label (whether an application was fraudulent or not). It also contained several identifying data fields about the applicant such as SSN, address, phone number, etc. The dataset had 1,000,000 records and 10 data fields. We first described and visualized each of the 10 data fields and treated all frivolous values. Then we created 634 candidate variables and performed feature selection to reduce them to 30. Finally, we used a few different machine learning algorithms (both linear and nonlinear) to predict fraudulent applications records.
Credit card fraud is a burden for organizations across the globe. Specifically, $24.26 billion were lost due to credit card fraud worldwide in 2018, according to shiftprocessing.com. In this project, our goal was to build an effective and efficient model to predict fraud. We analyzed a real-world dataset that contained a list of government related credit card transactions over the 2010 calendar year. The data presented a supervised problem as it included a column showing the transaction’s fraud label (whether a transaction was fraudulent or not). It also contained identifying information about each transaction such as the credit card number, merchant, merchant state, etc. The dataset had 96,753 records and 10 data fields. We first described and visualized each of the 10 data fields, cleaned the dataset, and filled in missing values. Then we created many variables and performed feature selection. Finally, we created a variety of machine learning models (both linear and nonlinear) and highlighted our results.
This project contains a web app that asks for a message from a potential user who is in danger during a disaster and the app categorizes that message into a particular category such as aid related, weather-related, fire or many more using natural language processing and AdaBoost classifier.
This repo contains a Dog breed classifier algorithm using deep learning. The main functions of this algorithm are that if a dog is detected in the image, it will provide an estimate of the dog's breed. If a human is detected, it will provide an estimate of the dog breed that is most resembling.
Contains Machine learning pipeline and ETL pipeline notebooks in order to practice and learn their working.
HW submission for INF - 552 (Machine Learning for Data Science)
A web application that predicts the salary of a job posting from job description and location using machine learning algorithms.
Content for Udacity's Machine Learning curriculum
MLFlow Spark Summit 2019 Presentation
This repo contains the files of a python package developed by me which automates the task of getting basic insights from data.
This project analyzes New York City’s (NYC’s) real estate data to specifically identify property tax fraud. The main indicators of property tax fraud were property tax assessments that were too high or too low. Given a property dataset of 1,070,994 records and 32 data fields, we first described, visualized, and filled in missing values for each variable. Second, 45 additional variables were created in order to create the most accurate algorithm. Next, we used dimensionality reduction techniques to refine our dataset. Finally, we used (principal component analysis (PCA) and an autoencoder) to obtain two separate fraud scores. The scores were combined and then ranked to get a final fraud score.
clone:https://github.com/REMitchell/python-scraping
This repo contains my first hands-on experience in developing a Recommendation engine using IBM Watson Studio dataset. The goal is to recommend the articles to the user using varius types of Recommendation engines that I studied while pursuing my Data Science Nanodegree from Udacity.
Deep learning model using NLP to predict job salary based on Indeed job postings
Calculate the real cost to run your JS app or lib to keep good performance. Show error in pull request if the cost exceeds the limit.
Contains all the 117 Leetcode questions with their solutions ranging from Easy to Hard in MySQL.
An Open Source Machine Learning Framework for Everyone
Course material for the class DSO 570: The Analytics Edge (Data, Models and Effective Decisions) at USC Marshall School of Business
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.