Giter Site home page Giter Site logo

Comments (20)

savingoyal avatar savingoyal commented on July 24, 2024 2

#1256 adds formal support for Airflow in Metaflow. Docs & release announcement to follow soon!

from metaflow.

danieltahara avatar danieltahara commented on July 24, 2024 1

I would second @impredicative 's comment that this is probably too broad.

In particular, I think there's potential, independent value of having a plugin implementation of a k8s cli, compute environment, and decorator. Based on a quick scan, it doesn't seem like there's too much functionality there to implement -- just make a kube job definition, come up with an annotation scheme (probably can do something similar to what airflow does), and handle cleanup. Drop in some example RBAC templates and you're probably good to go.

I think it would probably be fine to stop at container/job orchestration, and leave things like cluster autoscaling to pointers to existing k8s docs and tools.

The scheduler (i.e Airflow or AWS Step Function or Argo) seems like a separate discussion that's out of scope of a question of Kubernetes.

from metaflow.

talebzeghmi avatar talebzeghmi commented on July 24, 2024 1

Why not compile to Kubeflow Pipelines via an intermediate representation (IR) [1]?

[1] kubeflow/pipelines#3703

from metaflow.

savingoyal avatar savingoyal commented on July 24, 2024 1

For folks following this thread, we recently announced an equivalent support for AWS Step Functions. Here is an article with more details.

from metaflow.

SteNicholas avatar SteNicholas commented on July 24, 2024

@JoshZastrow I thought that MetaFlow could integrate with kubeflow, which is the machine learning toolkit for Kubernetes.

from metaflow.

savingoyal avatar savingoyal commented on July 24, 2024

@JoshZastrow Thanks for opening the issue! Yes, we are evaluating and prioritizing our roadmap currently.

from metaflow.

nlaille avatar nlaille commented on July 24, 2024

And what about using argo instead of airflow ? (https://github.com/argoproj/argo)
Can it be included in this issue or should it be another one ?

from metaflow.

savingoyal avatar savingoyal commented on July 24, 2024

@nlaille Let's track that as a separate issue so that people can vote and weigh in with their opinions.

from metaflow.

impredicative avatar impredicative commented on July 24, 2024

IMHO this issue is too broad. Let me separate the use of Airflow with and without Kubernetes. You probably don't need Metaflow if you're using Airflow with Kubernetes. You may need Metaflow as an Airflow executor and an Airflow operator if you're using Airflow without Kubernetes.

Admittedly not entirely familiar with what all Metaflow offers just yet

I love open source software and solutions including Airflow which I use, but I believe this issue should be closed unless the o.p. can substantiate what Metaflow would meaningfully add to the Airflow with Kubernetes combo.

from metaflow.

Jeffwan avatar Jeffwan commented on July 24, 2024

Orchestration part could be cloud solutions like AWS Step Function or container based orchestration solutions like argo or other orchestrations like Airflow.

One reasonable option is to map metaflow DAG to step function/ARGO/Airflow DAG and execute remotely. Computing resources need to be changed correspondingly. Totally agree on @impredicative 's point, unless users have clear requirements, otherwise, it's not that meaningful to do this integration.

from metaflow.

savingoyal avatar savingoyal commented on July 24, 2024

@talebzeghmi Yes, an IR for KfP would be great. Is there an RFC for it? We are happy to contribute our thoughts.

from metaflow.

impredicative avatar impredicative commented on July 24, 2024

There are existing mechanisms for triggering workflows based on external events.

For clarity, what are these? What if I want to trigger it on a schedule like Airflow allows me to do?

from metaflow.

savingoyal avatar savingoyal commented on July 24, 2024

For AWS Step Functions, we provide time-based triggers out of the box right now. You can very easily configure other triggers (say data availability in S3 using Amazon EventBridge).

from metaflow.

lucianoviola avatar lucianoviola commented on July 24, 2024

@savingoyal is there any support for event-based triggers? (e.g. REST API)

from metaflow.

savingoyal avatar savingoyal commented on July 24, 2024

@lucianoviola Yes, you can use AWS Event Bridge to do event-based triggering of Step Functions workflows.

from metaflow.

savingoyal avatar savingoyal commented on July 24, 2024

#50 (comment) If you would like to try out and give feedback on our Kubernetes integration, please reach out at http://slack.outerbounds.co

from metaflow.

savingoyal avatar savingoyal commented on July 24, 2024

#992 provides GA support for Kubernetes. https://github.com/outerbounds/metaflow/tree/airflow is tracking the Airflow integration on top of Kubernetes.

from metaflow.

wanghong230 avatar wanghong230 commented on July 24, 2024

Kubernetes support was done via supporting the Argo-Workflows, great!
#992 (Dispatch Metaflow flows to Argo Workflows)

from metaflow.

savingoyal avatar savingoyal commented on July 24, 2024

This branch tracks the work for this issue.

from metaflow.

savingoyal avatar savingoyal commented on July 24, 2024

https://outerbounds.com/blog/better-airflow-with-metaflow/

from metaflow.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.