Giter Site home page Giter Site logo

mgloder / stream-processing-workshop Goto Github PK

View Code? Open in Web Editor NEW
1.0 2.0 0.0 3.25 MB

demo project for gdelt

Home Page: https://machinedoll-wonderland.com/

License: MIT License

Scala 80.32% Dockerfile 7.98% Shell 11.69%
pravega flink demo-app gdelt-events

stream-processing-workshop's Introduction

Stream-processing-workshop

Current Status

Trello Board

Data Source- GDELT

The GDELT Project

Supported by Google Jigsaw, the GDELT Project monitors the world's broadcast, print, and web news from nearly every corner of every country in over 100 languages and identifies the people, locations, organizations, themes, sources, emotions, counts, quotes, images and events driving our global society every second of every day, creating a free open platform for computing on the entire world. (Copied from official website)

Workflow

Requirements

Install and run Pravega

[Option 1] from installation package

[Option 2] from docker

docker run -it -e HOST_IP=<ip> -p 9090:9090 -p 12345:12345 pravega/pravega:latest standalone

Install and run Flink

Run on docker

  • docker pull flink:scala_2.11
  • cd references/flink-docker
  • docker-compose up

Install and run Kafka

Quick start

Create a new topic

  • Example
    • kafka-topics.sh --create --bootstrap-server localhost:9092 --replication-factor 3 --partitions 1 --topic my-replicated-topic

Start a producer

  • Example
    • kafka-console-producer.sh --broker-list localhost:9092 --topic test

Start a consumer

  • Example
    • kafka-console-consumer.sh --bootstrap-server localhost:9092 --topic test --from-beginning

Install and run Druid

Download Druid

Run Druid

bin/supervise -c quickstart/tutorial/conf/tutorial-cluster.conf

modify tutorial-cluster.conf as desired

Run Jobs

Option 4

Export Data Producer

--class com.machinedoll.projectdemo.jobs.option4.ExportDataProducer

Export Data Consumer

--class com.machinedoll.projectdemo.jobs.option4.ExportDataConsumer

stream-processing-workshop's People

Contributors

dependabot[bot] avatar machine-agent avatar mgloder avatar

Stargazers

 avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.