Giter Site home page Giter Site logo

Hi there, I'm Caron

My AWS Certified Cloud Practitioner Badge

I am a Mathematical & Computer Science graduate, with experience as an intern Data Scientist at ExploreAI Academy.

Proficient with SQL, Python, Power BI, Exploratory Data Analysis, Machine Learning, AWS EC2.

Highly Skilled In: Team Leading, Project Management, Time Management, Communication, Data Science Life Cycle.

Projects I have worked on:

  • Regression: Used regression machine learning models to predict the three-hourly electricity load shortfall.
  • Classification: Using Natural Language Processing (NLP) and classification machine learning models to classify tweets into negative, positive, neutral, or factual news.
  • Unsupervised learning: Built a movie recommender system that recommends movies to a user, and hosted the web application with the help of AWS.
  • Credit card fraud detection project where I built machine learning models to predict whether a credit transaction is fraudulent or not.
  • Loan default prediction project where the models predict whether a person is most likely to default on their loan or not.

Machine Learning models I have worked on:

  • Regression: Linear Regression, Decision Tree, Random Forest, XGBoost, Voting Regressor, Stacking Regressor.
  • Classification: Logistic Regression, Random Forest Classifier, KNN, Naive Bayes Classifier.
  • Unsupervised: Content-based and collaborative filtering using Singular Value Decomposition, Non-negative Matrix Factorization, Clustering to group movies together based on their similarity in terms of genre, director, actors, and other features, and Principal Component Analysis(PCA) to identify the most important features that contribute to a user's movie preferences.

Languages and tools ⚙️

AWS, Jupyter Notebook, Git and GitHub, Virtual Studio Code, PowerBI, Trello, Discord, Slack.

Python Logo Bash Logo AWS Logo VSCode Logo


Feel free to view more on Linkedin 😄 .

A picture of Caron Sathekge

Caron Sathekge's Projects

credit_card_fraud_detection icon credit_card_fraud_detection

The data set contains transactions made by credit cards in September 2013 by European cardholders. It consists of transactions that occured in 2 days, where 492 of the transactions are fraud, out of 284 807 transactions.

credit_risk_modeling icon credit_risk_modeling

Credit Risk : The possibility of a loss resulting from a borrower's failure to repay a loan or meet contractual obligations.

deeplearningexplorations2 icon deeplearningexplorations2

Exploring different Deep learning problems primarily (but not limited to) using FastAI2, HuggingFace and Pytorch Library

filestore icon filestore

A repo where files will be stored, and accessed via the internet.

language_identification_hackathon icon language_identification_hackathon

In this challenge, we will take text which is in any of South Africa's 11 Official languages and identify which language the text is in. This is an example of NLP's Language Identification, the task of determining the natural language that a piece of text is written in.

myresources icon myresources

A repo where I saved my Data Science Resources in one place

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.