Giter Site home page Giter Site logo

epidemicsound / dataflowtemplates Goto Github PK

View Code? Open in Web Editor NEW

This project forked from googlecloudplatform/dataflowtemplates

0.0 1.0 0.0 4.23 MB

Google-provided Cloud Dataflow template pipelines for solving simple in-Cloud data tasks

Home Page: https://goo.gl/BPX2zL

License: Apache License 2.0

Java 99.09% JavaScript 0.29% Dockerfile 0.02% Python 0.17% PureBasic 0.01% Go 0.44%

dataflowtemplates's Introduction

Google Cloud Dataflow Template Pipelines

This fork is done to "fix" some state of the Google's original repo, since they don't maintain any tags or versions there. The only part is changed so far is this README file.

It will just hold the instruction on how to build Google's templates in a way, Google does not do so far.

Prerequisites

On your local machine you need to have:

  • Java 8. With some newer versions build fails, but with Java 8 it works just fine. And make sure that JAVA_HOME is pointing to this version.
  • Maven 3

I propose to use a kind of the same version convention in a bucket with built jos as Google does, so when you want to make a new build - place it in gs://dataflow-staging-europe-west1-105942741667/templates/google/YYYY-MM-DD/Job_Name

PubsubToAvro in subscritpion mode

PubsubToAvro can be build in 2 ways:

  • where you can define input topic, and job will create subscription automatically every time it's started - this one is being built by Google, but we don't want it.
  • where you can define input subscription - this one we need. And since Google for some reason don't build it itself in their public bucket, the instruction below will guide how to do that.

In order to build new version and upload it to GCS simply run the following command (put the actual date inside!):

mvn compile exec:java \
-Dexec.mainClass=com.google.cloud.teleport.templates.PubsubToAvro \
-Dexec.cleanupDaemonThreads=false \
-Dexec.args=" \
--project=epidemic-data-infra \
--stagingLocation=gs://dataflow-staging-europe-west1-105942741667/staging \
--tempLocation=gs://dataflow-staging-europe-west1-105942741667/temp \
--templateLocation=gs://dataflow-staging-europe-west1-105942741667/templates/google/[DATE in YYYY-MM-DD]/PubsubToAvro \
--runner=DataflowRunner \
--region=europe-west1 \
--useSubscription=true"

And then just the the folder.

dataflowtemplates's People

Contributors

cloud-teleport avatar dhercher avatar cherepushko avatar zhoufek avatar pabloem avatar pranavbhandari24 avatar biswanag avatar melbrodrigues avatar billyjacobson avatar allenpradeep avatar prathapreddy123 avatar cmm08 avatar ryanmcdowell avatar oleg-semenov avatar weiwenxu21 avatar azurezyq avatar jasonkuster avatar chrlarsen avatar manavgarg avatar sabhyankar avatar kirill-barkunov-es avatar n-oden avatar yadnikipawar avatar johndee7310 avatar pareshsarafmdb avatar danieldeleo avatar damienhollis avatar ayush-poddar avatar rarsan avatar pritukam avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.