Giter Site home page Giter Site logo

airflow-mlflow-ml's Introduction

Airflow 2.x for ML Pipeline

This is the git repo for learning airflow pipeline implementation for machine learning use case. The code has been improvised from the source repositories:

  1. e2e-ml-pipeline-airflow - https://github.com/NicoloAlbanese/airflow-ml-pipeline-mvp
  2. pycon-sweden-airflow-ml-pipelines - https://github.com/pycon-ml/airflow_workshop

Pre-requisites

You should have docker and docker-compose installed on your machine !

The easiest way to have everything ready for the workshop is to install Docker Desktop

Docker resource requirement

Minimum resource requirement for docker to start all the services is mentioned below:

Resource Recommendation
Memory 3 GB
CPU 2 CPU

Getting started

1. Clone this repo

Clone the repo and cd into corresponding folder.

2. Use docker-compose to start the applications:

docker-compose up

4. Access services from browser

Airflow

UI: http://localhost:8080

Username: airflow

Password: airflow

MLflow

http://localhost:5000

Celery Flower

http://localhost:5555

Tear down

Stop and remove containers, networks, images, and volumes

docker-compose down

Supplements for troubleshooting

  1. https://stackoverflow.com/questions/66172375/docker-desktop-is-using-12-gb-ram-to-run-one-container-with-24-mb-ram
  2. https://stackoverflow.com/questions/66699394/airflow-how-to-get-pip-packages-installed-via-their-docker-compose-yml/66701128#66701128
  3. helm/charts#23589 (comment)

References

  1. https://pretalx.com/pycon-sweden-2021/talk/JRCLRG/
  2. https://towardsdatascience.com/end-to-end-machine-learning-pipeline-with-docker-and-apache-airflow-from-scratch-35f6a75f57ad

airflow-mlflow-ml's People

Watchers

Shrikanth Singh avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.