Giter Site home page Giter Site logo

Arpita Parmar

I am a passionate data scientist who has broad and in-depth data engineering, programming, statistics skills. I am using these skills to solve various business problems by using machine learning, data mining, and other types of data analytics and data visualization tools such as Python, Spark, Databricks, Azure suite of data tools, TensorFlow, Karas, Tableau, Hive, Power BI , Azure Synapse etc. I have more than 12 years of experience in data analytics, data mining and predictive modeling. I did my Masters in Computer Science at National University.

In order to showcase my work in my portfolio, all outputs contain anonymized,synthesized sample data. There is no sensitive or proprietary information contained in any of the outputs.

Click on each project title below to view github repository.

For this project, I built a supervised classification model which predicts which employee will stay with company and which employee will leave the company. This also estimates the probability of an employee leaving. The project also has presciptive solution where it shows what are the reasons behind employee atrition so an organization can take appropriate action to avoid employee attrition.

For this project, I used Facebook's Prophet package which predicts passenger Seats availability based on historic trends. I also created a flavour of algorithm where it predicts availability by different category in a loop. I added extra regressors for missing dates/data so that the model is not underfitting. These same extra regressors can be used for other variables, so the same code can be used for multivariate analysis. Plotly intercative charts are used for all forecasts so one can switch between different time periods in same chart without having to create multiple forecasts for multiple time periods.

This Project uses NLP libraries to analyze sentiments and tags them as positive , negative and neutral sentiments. It uses NLTK libraries to tokenize the words, and it also has word cloud to see what are most used words in a comment or conversation.

This projects aims to look at climate change by examining hurricane data from NOAA (National Oceanic and Atmospheric Administration) regarding the Atlantic basin. It does Geospatial analysis using python library folium to analyze hurricane tracks, landfalls and their impact over the years in US region. Folium maps used for this project shows the heatmap effect over the years theough sliders. This project can be used to understand geographical impact of any factor by analysing latitude and longitude data.

Inspired by paper https://arxiv.org/pdf/1506.06579.pdf. This project visualizes Neural Network Activation, Weights, Gradients to understand and interpret neural networks and what goes behind network activation and other aspects.

For this experiment, I built an unsupervised clustering algorithm which segments retail transactions in groups based on similarity and also detencts anamolous transactions.

This project contains analysis of Covid-19 cases worldwide and in US. It uses geospatial libraries to visualize data through interactive maps and it also uses Logistic regression sigmoid model to predict cases in future. It is an end to end machine learning solution.

My exposure to vision AI and tensorflow is limited, but in this project I attempted to create a binary classifier for Image and image segmentation.

This project uses shell and open source utility called "DATMO" to track tensoeflow models. Datmo is an open source production model management tool for data scientists

ArpitaisAn0maly's Projects

azure-search-openai-demo icon azure-search-openai-demo

A sample app for the Retrieval-Augmented Generation pattern running in Azure, using Azure Cognitive Search for retrieval and Azure OpenAI large language models to power ChatGPT-style and Q&A experiences.

geospatialanalysis_folium icon geospatialanalysis_folium

This repository contains Geospatial analysis files using python library folium to analyze hurricane tracks, landfalls and their impact over the years in US region

machine-learning-covid19 icon machine-learning-covid19

This Repository contains analysis of covid 19 cases worldwide and in US. It uses geospatial libraries to visualize data through interactive maps and it also uses sigmoid model to predict cases in future. It is an end to end machine learning solution.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.