msellamitn Goto Github PK
Type: User
Type: User
📝 A compilation of everything that I learn; Computer Science, Software Development, Engineering, Math, and Coding in General.
Open Content for self-directed learning in data science
Code repo for Learning Data Mining with Python, published by Packt Publishing
⚡ The process of learning DevOps engineering
Slides and Jupyter notebooks for the Deep Learning lectures at M2 Data Science Université Paris Saclay
DATAQUEST October's Monthly Challenge
Parallel implementation of the Effective Fragment Potential Method
Prediction of loan defaulter based on more than 5L records using Python, Numpy, Pandas and XGBoost
Generates more or less realistic log data for testing simple aggregation queries.
Log processing system using Flume and Cassandra
LookML for ecommerce product demonstration
VS Code syntax highlighting for Looker Model files (.lkml)
This project goal is to demonstrate how to use LSTM networks and apply in some real data.
My code for various Lynda.com courses.
Realtime M2M Traffic corridor creation for emergency service
Classical equations and diagrams in machine learning
Monthly Series - Machine Learning Top 10 Open Source Projects
Python & R Data Science Projects
Source code about machine learning and security.
Geo Spatial Data Analytics on Spark
Predict whether a mammogram mass is benign or malignant We'll be using the "mammographic masses" public dataset from the UCI repository (source: https://archive.ics.uci.edu/ml/datasets/Mammographic+Mass) This data contains 961 instances of masses detected in mammograms, and contains the following attributes: 1. BI-RADS assessment: 1 to 5 (ordinal) 2. Age: patient's age in years (integer) 3. Shape: mass shape: round=1 oval=2 lobular=3 irregular=4 (nominal) 4. Margin: mass margin: circumscribed=1 microlobulated=2 obscured=3 ill-defined=4 spiculated=5 (nominal) 5. Density: mass density high=1 iso=2 low=3 fat-containing=4 (ordinal) 6. Severity: benign=0 or malignant=1 (binominal) BI-RADS is an assesment of how confident the severity classification is; it is not a "predictive" attribute and so we will discard it. The age, shape, margin, and density attributes are the features that we will build our model with, and "severity" is the classification we will attempt to predict based on those attributes. Although "shape" and "margin" are nominal data types, which sklearn typically doesn't deal with well, they are close enough to ordinal that we shouldn't just discard them. The "shape" for example is ordered increasingly from round to irregular. A lot of unnecessary anguish and surgery arises from false positives arising from mammogram results. If we can build a better way to interpret them through supervised machine learning, it could improve a lot of lives. we will apply several different supervised machine learning techniques to this data set, and see which one yields the highest accuracy as measured with K-Fold cross validation (K=10). we will apply: * Decision tree * Random forest * KNN * Naive Bayes * SVM * Logistic Regression * And, as a bonus challenge, a neural network using Keras.
Applied unsupervised learning technique to group customers that exhibit similar behaviors for further marketing campaigns. Develop a hypothesis for purchasing trends, analyzed purchasing behavior of each segment of to understand true buying habits
Mastering Apache Spark 2
From the book of the sample title, implementing the solution based on my understanding rather than just copying them 1-to-1
Code repository for the Mastering Kafka Streams and ksqlDB book
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.