Alex Furrier's Projects
š Papers & tech blogs by companies sharing their work on data science & machine learning in production.
:memo: An awesome Data Science repository to learn and apply for real world problems.
A Libgen Fiction store plugin for Calibre
A module with boilerplate code for computing and plotting common classification metrics. Flexible to multiclass problems.
A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
Intro to Computing for Data Analysis
Various code to aid in data science projects for tasks involving data cleaning, ETL, EDA, NLP, viz, feature engineering, feature selection, etc.
Various code to aid in data science projects for tasks involving data cleaning, ETL, EDA, NLP, viz, feature engineering, feature selection, model validation, etc.
T2S4FWF -> (Text to Speech for Fun With Friends)
A default project structure for data projects with a focus on repoducible research and build automation
Pipeline for extracting sentiment towards entities
Fiddling with GPT2 and other NLP models for interesting corpus text generation
The Open Source Data Science Masters
Scraping UA salary database with python script
NBA hackathon 2018
Hackable implementation of state-of-the-art open-source LLMs based on nanoGPT. Supports flash attention, 4-bit and 8-bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
Free MLOps course from DataTalks.Club
Practice your pandas skills!
aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-first, mathematics-second point of view. All in pure Python ;)
Probabilistic Programming in Python: Bayesian Modeling and Probabilistic Machine Learning with Theano
š Python project template with unit tests, code coverage, linting, type checking, Makefile wrapper, and GitHub Actions.
Rec Center Count data for finding the optimal time to visit
:art: A ridiculously elegant Jekyll theme.
A Discord bot for LLM chain apps
Using the stats to illustrate the Sean Miller Era of UA basketball
Example reproducible analysis project for a supervised learning data science problem
š¤ Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX.