Giter Site home page Giter Site logo

hivenetes / kubernetes-for-machine-learning Goto Github PK

View Code? Open in Web Editor NEW

This project forked from bikram20/kubernetes-for-machine-learning

0.0 0.0 0.0 524 KB

MLOps with Kubernetes: Harnessing GPU Power. Hands-on concepts including NVIDIA GPUs, Microk8s, and Kubeflow.

License: MIT License

Python 80.29% Dockerfile 8.45% Jinja 11.26%

kubernetes-for-machine-learning's Introduction

Setting up Machine Learning Infrastructure on Kubernetes

This tutorial is tailored for Kubernetes and Devops engineers looking to deepen their understanding of machine learning operations with a focus on Kubernetes and containers.

To put this in perspective, there are 3 different personas.

  • Application developers using APIs (eg. Openai api) to build applications. This is not relevant to you.
  • Data Scientists building and finetuning models as a product. This is not relevant to you.
  • Engineers looking to build/operate/learn about ML infrastructure and processes. This is for you.

We'll use MicroK8s on Ubuntu 22.04, running on Paperspace cloud by DigitalOcean, giving you real-world experience in managing ML workflows in Kubernetes.

Why MicroK8s?

  • Thin footprint and low operational overhead to run on a single VM. Hence useful for learning.
  • Highly available and production-ready

Table of Contents

1. Basics of GPU

Learn the fundamentals of GPU technology, with a focus on NVIDIA GPUs. We'll cover hardware checks, driver installation, and dive into GPU compute, memory, and scheduling, including hands-on command-line examples.

2. MLOps

Compare between DevOps and MLOps for Machine learning. While MLOps tooling can be very broad, start with developer view and add the tool only when there is a need.

3. GPU + Container

Set up container environments and deploying GPU-based applications.

4. GPU + Kubernetes

Explore how GPUs integrate with Kubernetes, using Microk8s. We self-host Microk8s/Ubuntu on Paperspace.

5. Multi-instance GPU

Optimize Kubernetes for complex scenarios like multi-instance GPUs, enhancing resource utilization and ML workload performance.

6. Ansible Setup

Set up a GPU-ready, multi-node Microk8s cluster with shared storage on Paperspace using Ansible with opinionated settings.

kubernetes-for-machine-learning's People

Contributors

bikram20 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.