Giter Site home page Giter Site logo

Hi there, I'm Caron

My AWS Certified Cloud Practitioner Badge

I am a Mathematical & Computer Science graduate, with experience as an intern Data Scientist at ExploreAI Academy.

Proficient with SQL, Python, Power BI, Exploratory Data Analysis, Machine Learning, AWS EC2.

Highly Skilled In: Team Leading, Project Management, Time Management, Communication, Data Science Life Cycle.

Projects I have worked on:

  • Regression: Used regression machine learning models to predict the three-hourly electricity load shortfall.
  • Classification: Using Natural Language Processing (NLP) and classification machine learning models to classify tweets into negative, positive, neutral, or factual news.
  • Unsupervised learning: Built a movie recommender system that recommends movies to a user, and hosted the web application with the help of AWS.
  • Credit card fraud detection project where I built machine learning models to predict whether a credit transaction is fraudulent or not.
  • Loan default prediction project where the models predict whether a person is most likely to default on their loan or not.

Machine Learning models I have worked on:

  • Regression: Linear Regression, Decision Tree, Random Forest, XGBoost, Voting Regressor, Stacking Regressor.
  • Classification: Logistic Regression, Random Forest Classifier, KNN, Naive Bayes Classifier.
  • Unsupervised: Content-based and collaborative filtering using Singular Value Decomposition, Non-negative Matrix Factorization, Clustering to group movies together based on their similarity in terms of genre, director, actors, and other features, and Principal Component Analysis(PCA) to identify the most important features that contribute to a user's movie preferences.

Languages and tools ⚙️

AWS, Jupyter Notebook, Git and GitHub, Virtual Studio Code, PowerBI, Trello, Discord, Slack.

Python Logo Bash Logo AWS Logo VSCode Logo


Feel free to view more on Linkedin 😄 .

A picture of Caron Sathekge

Caron Sathekge's Projects

powerbi icon powerbi

Power BI Data Analysis, Cleaning, Reporting, Visualization

project-walkthroughs icon project-walkthroughs

Data science, machine learning, and web development project code for https://www.youtube.com/c/Dataquestio .

real_time_stock_streaming_with_azure_soark-cashtag icon real_time_stock_streaming_with_azure_soark-cashtag

My Insight Data Engineering Fellowship project. I implemented a big data processing pipeline based on ​lambda architecture​, that aggregates Twitter and US stock market data for user sentiment analysis using open source tools - ​Apache Kafka ​for data ingestions, Apache Spark ​& ​Spark Streaming ​for batch & real-time processing, ​Apache Cassandra

sql-code-ssms icon sql-code-ssms

All SQL Code or Documents from AlexTheAnalyst YouTube videos

tesla-clone icon tesla-clone

Used React Native to clone the front end part of the Tesla App

twitter-sentiment-analysis-nlp-hackathon icon twitter-sentiment-analysis-nlp-hackathon

Problem Statement: Given the tweets from customers about various tech firms who manufacture and sell mobiles, computers, laptops, etc, the task is to identify if the tweets have a negative sentiment towards such companies or products.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.