perceive9019 Goto Github PK

followers: 0.0 following: 13.0 repos: 26.0 gists: 0.0

Name: Gary Waiyaki

Type: User

Bio: Gary enjoys merging teaching expertise with data prowess, illuminating insightful pathways for informed decision-making.

Hi 👋🏽 My name is Gary Waiyaki

Data Scientist | Data Analyst | Teacher

I am a dedicated educator and data scientist, passionate about leveraging big data to solve real-world challenges and drive valuable insights for informed decision-making. I am enthusiastic about collaborating on innovative and data-centric projects, aiming to achieve meaningful outcomes through utilizing data-driven strategies to catalyze transformation.

🌍 I'm based in Chandler, AZ/ NYC, New York
🖥️ See my portfolio at My Portfolio
✉️ You can contact me at [email protected]
🧠 I'm learning LSTM Neural Networks
🤝 I'm open to collaborating on Neural Network Libraries: Keras, TensorFlow, and PyTorch
⚡ I love Cooking, Indie Movies, skydiving, the outdoors, and dogs

Skills

Socials

Badges

My GitHub Stats

Top Repositories

Gary Waiyaki's Projects

bayesian_parameter_optimization

Work through this exercise to hone your visualization abilities and understanding of Bayesian parameter optimization in Python for a Light GBM model.

capstone_2_project

In this projects you will embark on: Cleaning and Transforming data such as handling missing data, removing duplicates etc. You will visualize the data relationships i.e. correlation heatmaps, pairplots etc. You will pre-process data and split it into testing and training datasets. Finally you will present and share your findings.

capstone_3_project

Time Series Project on UK Electricity Consumption 2009-2023

content-source-control-git

Kenny's Source Control with Git Public Repo

cosine_similarity_casestudy

Practice what you've learned about cosine similarity by completing this exercise. While working through this exercise, you'll get to see how cosine similarity is calculated with a numeric dataset and explore the utility of cosine similarity for record matching and NLP projects.

customer_segmentation_using_kmeansclustering

This case study explores K-Means clustering to find the value for K using the Elbow method, the Silhouette method, the Gap statistic, and visualize the clusters with Principal Components Analysis (PCA) while using real data containing information on marketing newsletters and email campaigns, as well as transaction-level data from customers.

datascienceguidedcapstone

decision_tree_specialty_coffee_case-study

The case study will involve your use of the full data science pipeline, from importing, loading and cleaning the data right through to modeling and concluding. In the case study, your decision trees will properly implement the supervised learning method of classification.

euclidean_and_manhattan_distance_case_study

Keen to put what you've learned about Euclidean and Manhattan distance to the test? This exercise asks you to apply these two distance metrics and visualize their distances on the same dataset.

frequentist_inference_case_study_parts_a_b

In this case study, you’ll learn more about frequentist inference. There are two parts to the case study. In part A, you’ll learn the Pythonic implementation of the concepts underlying frequentist inference. In Part B, you’ll apply those implementations to a real-world scenario

gradientboosting_ensemblemethod_casestudy

In this exercise, you will gain a full understanding of how gradient boosting works to improve predictions based on information from the residuals. First, you'll apply this method to a regression problem then to a classification problem using the Titanic dataset.

gridsearch_in_knn

In this exercise, using grid search method, you'll identify the optimal number of neighbors to use in the K-nearest neighbor model.

logisticregression_predictingheartdisease

nasdaq-api-mini-project

Sharpen your data wrangling skills by completing this mini-project.

perceive9019

perceive9019.github.io

randomforest_covid_casestudy

In this case study, you'll use Random Forest and logistic regression to understand the scope of the Coronavirus using data from December and January of 2020.

rating_apps_google_vs_apple_case_study

In this case study, you'll analyze whether there is a significant difference between the ratings on these two platforms that would justify choosing one over the other. If there's not, you can always just flip a coin to pick which platform to use at random.