sandy4321 Goto Github PK
Type: User
Type: User
US census data analysis
Spring 2017
Resources for Data Centric AI
A reference for primarily data cleaning. The dataset we work with is a sample of the data used in this data challenge: https://community.fico.com/s/explainable-machine-learning-challenge?tabset-3158a=2
Dashboard for Data Drift Detection in Python with Evidently and Mercury
Applying Bayesian methods to Moscow Sex Workers
4th place Data Fusion Contest solution
Многоклассовая классификация товаров в чеках
This project offers free activities to practise reproducible data presentation. Pablo Bernabeu organises these events in the context of a Software Sustainability Institute fellowship.
Data journalism and easy to replicate notebooks using Python, R, and Web visualisations
Free, open source data set describing McDonalds Nutrition Facts for popular menu items in SQL, SQLite, JSON, Excel, OpenOffice Spreadsheet, Google Sheets, TSV, etc. Updated November 2015. :hamburger: :fries: :cookie: :cake:
Time series analysis, Sequential pattern mining of IBM stock data set and Classification, Outlier Detection of UCI Abalone data set.
Project was designed in QT Designer and coded in Python using twitter API to fetch data from twitter. Overcoming the issue of limited fetching from twitter. Also introduced multithreading in order to fetch data based on multiple keywords. After the fetching of the tweets, they were analysed based on the words in the tweets whether positive, negative or neutral. And these analysed tweets were further represented in graphical format using MATLAB.
Eclat algorithm written in python used to analyze data. Measures: Confidence, Lift, Leverage, Jaccard, Conviction, Odds Ratio
This is the code part of the data mining module, mainly for predicting the outcome of categorical variables, the model mainly involves logistic regression, decision tree, SVM, random forest and naive bayes.
This is an implementation of MS GSP algorithm with multiple minimum supports that is used for mining sequential patterns
Git repo for the 2021 Citadel Data Open Central Region. Team member: Chuqi Bian, Minglun Pan, Shutong Li, Yuantao Shi
Source code of DPSA lecture notes
This is a data quality assurance and exploration project for Exactbid's bidding platform
Assessing risk ratings from insurance client data
data-science-from-scratch of joelgrus
Data Science Projects, mostly a part of Prof. Steven Skiena's CSE 519 : Data Science Fundamentals
Ipython notebook presentations for getting starting with basic programming, statistics and machine learning techniques
Carefully curated resource links for data science in one place
A curated list of data science blogs
Code examples of one of the best solutions for some Kaggle and DrivenData competitions
code for Data Science From Scratch book
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.