Giter Site home page Giter Site logo

Hi there and welcome! 👋

About Me:

I am a passionate and dedicated machine learning engineer with a proven track record of developing high-scale machine learning workflows. Leveraging a robust blend of data science, data engineering, and software engineering skills, I specialize in creating innovative solutions to complex problems. My expertise spans various machine learning techniques, including linear regression, classification, clustering, and Natural Language Processing (NLP) tasks. I have extensive experience designing and implementing predictive models that drive actionable insights and business value. Whether it is building sophisticated ETL and Reverse ETL pipelines or optimizing models for predictive accuracy, I thrive on tackling challenging projects and delivering exceptional results.

In addition to my technical skills, I am committed to continuous learning and staying abreast of the latest advancements in the field. I advocate for best practices in machine learning and software development practices, ensuring that my workflows are efficient, scalable, and maintainable. I enjoy collaborating with cross-functional teams to translate business requirements into technical specifications, and I am adept at communicating complex technical concepts to non-technical stakeholders. My goal is to harness the power of machine learning to solve real-world problems and contribute to impactful projects.

Languages and Tools

Cloud:

   Azure       AWS          GCP      
Azure AWS GCP

Languages:

Python JavaScript    SQL   
Python JavaScript SQL SQL SQL

Favorite Frameworks and Libraries:

Hugging Face Selenium Numpy Pandas PySpark Sklearn Openpyxl
Hugging Face Selenium Numpy Pandas Spark Scikitlearn Openpyxl
Django React Pytorch Tensorflow
Django React Pytorch Tensorflow

Environments, Testing, Other:

   Git       Docker       Pytest       Postman       Vite   
Git Docker Pytest Postman Vite

Samuel Rodriguez's Projects

airbnb_ml icon airbnb_ml

Followed the CRISP-DM process to clean, transform, and evaluate two algorithms (multiple linear regression and random forest regression) to predict AIRBNB listing prices

cvsarimax icon cvsarimax

CVSARIMAX is a Python package designed to facilitate robust forecasting by implementing sequential cross-validation techniques with SARIMAX models. This package provides an essential tool for analysts and data scientists who need to evaluate time series forecasting models accurately, ensuring that the temporal integrity of the data is maintained.

disaster_response_webapp icon disaster_response_webapp

Created a web app to display disaster response categories predicted based on a random forest classifier trained over an NLP pipeline that cleans, stems, lemmatize, and vectorize the provided data

enron_ml icon enron_ml

Predicting point of interest in the Enron fraud case using scikit-learn

global-temp icon global-temp

Final Capstone project of Udacity Data Engineering Nanodegree.

identifying_customer_segments icon identifying_customer_segments

Applied unsupervised learning techniques to identify segments of the population that form the core customer base for a real-life mail-order sales company in Germany to aid the direct marketing campaigns towards audiences that will have the highest expected rate of returns.

metamorph icon metamorph

A package for de-identifying data, transforming it for privacy.

predicting_charity_donors icon predicting_charity_donors

Employed several supervised algorithms and chose the best candidate algorithm from preliminary results using data collected from the 1994 U.S. Census and further optimize this algorithm to best model the data to accurately predict whether an individual makes more than $50,000.

pystickyn icon pystickyn

A Python package for organizing your thoughts in sticky notes within a Jupyter Notebook.

sakeoflearning icon sakeoflearning

Cheat sheets, tutorials, and more... from Python topics I have learned over the years.

sparkify icon sparkify

Predict when users are about to churn or cancel the services. So basically it is a warning detection to prevent possible revenue loss due to service cancelling. It uses a Random Forest Classifier to as the model of choice.

titanic-analysis icon titanic-analysis

Posed a question about a dataset, then used NumPy and Pandas to answer that question based on the data and created a report to share the results.

udemy-practice icon udemy-practice

This is where I refine my skills by constantly taking Udemy courses. I push the contents of the course here, including my practice files, to be able to clone them from wherever I am. I also practice git this way.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.