Giter Site home page Giter Site logo

ranglang / hydra Goto Github PK

View Code? Open in Web Editor NEW

This project forked from pluralsight/hydra

0.0 2.0 0.0 2.41 MB

A real-time data replication platform that "unbundles" the receiving, transforming, and transport of data streams.

License: Apache License 2.0

Scala 95.22% Java 4.76% TSQL 0.02%

hydra's Introduction

Hydra

Build Status codecov Join the chat at https://gitter.im/pluralsight/hydra

Hydra is a real-time streaming and data replication platform that "unbundles" the receiving, transforming, and production of data streams.

It does so by abstracting these phases independently from each other while providing a unifying API across them.

Why Replication?

The goal behind Hydra's replication protocol is to separate the ingestion of events from any transformation and storage.

This replication paradigm allows a single event to be ingested, transformed, and then replicated in real-time to several different data systems (Kafka, Postgres, etc.)

Replication Phases

Receive

The Receive phase receives "raw" data, converts it to a Hydra request containing a payload and metadata, and broadcasts the request to the underlying Hydra Ingestors.

Ingestion

This phase involves matching the request with one or more ingestors, performing any data transformation and sending the request to a Transport. Complex streaming operations, such as joins, cross-stream aggregations are outside the scope of the ingestion phase.

Transport

The transport phase of the protocol the step at which the events are actually sent (produced) to the underlying data system.

Hydra Modules

Common

A set of common util-based classes meant to be shared across the entire Hydra ecosystem, including hydra-spark and dispatch modules.

Core

Core, shared traits and classes that define both the ingestion and transport protocols in Hydra. This module is a dependency to any ingestor or transport implementation.

This also includes Avro schema resolution, validation, and management.

Ingest

The ingestion implementation, including HTTP endpoints, actors, communication protocols, registration of ingestors, and a lot of other stuff.

Kafka

A Transport implementation that replicates messages into Kafka.

JDBC

A transport implementation that replicates messages into databases via JDBC.

Building Hydra

Hydra is built using SBT. To build Hydra, run:

sbt clean compile

Running Hydra

We provide a sandbox module with some simple ingestors that can be run from SBT to test Hydra's main functionality.

The walk-through below shows you how to use Hydra with some example Ingetors, by running Hydra in local development mode in SBT. This is not an example of usage in production.

From SBT

You need to have SBT installed.

We are using Spray's SBT Revolver plugin. In order to run the example sandbox project, fire up sbt and then type:

sbt 
> sandbox/reStart

SBT Revolver forks Hydra in a separate process. If you make a code change, simply type re-start again at the SBT shell prompt, it will compile your changes and restart the process. It enables very fast turnaround cycles.

Once Hydra is up and running you can send an HTTP request:

curl localhost:8080/ingest -d 'this is a test' -H "logging-enabled:true" -H "hydra-file-stream:default"

You should see a 'this is a test' message transported (or replicated) to both the sbt console and the /tmp/hydra-sandbox.txt file.

Using Docker

We provide a docker compose file that can be used to bootstrap Hydra.

Online Documentation

We highly recommend checking out the project documentation here. There you can find the latest documentation about the ingestion protocol, Akka actors, including examples, API endpoints, and a lot more info on how to get started.

This README file only contains basic definitions and set up instructions.

Main Features

Message Delivery Guarantees

Contribution and Development

Contributions via Github Pull Request are welcome.

Profiling software provided by

YourKit supports open source projects with its full-featured Java Profiler. YourKit, LLC is the creator of YourKit Java Profiler and YourKit .NET Profiler, innovative and intelligent tools for profiling Java and .NET applications.

Contact

Try using the gitter chat link above!

License

Apache 2.0, see LICENSE.md

hydra's People

Contributors

roarking avatar flybonzai avatar ivanwakeup avatar lewisjkl avatar bretrbs avatar bpetersen avatar seanag0234 avatar dvannoy avatar w-aldrich avatar josep2 avatar znickle24 avatar sullis avatar slynnda avatar james-hemphill avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.