Giter Site home page Giter Site logo

zhuohuwu0603 / azure-event-hubs-spark Goto Github PK

View Code? Open in Web Editor NEW

This project forked from azure/azure-event-hubs-spark

0.0 0.0 0.0 19.56 MB

Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs

License: Apache License 2.0

Scala 99.99% Shell 0.01%

azure-event-hubs-spark's Introduction

Azure Event Hubs + Apache Spark Connector

Azure Event Hubs Connector for Apache Spark

chat on gitter build status star our repo

This is the source code of the Azure Event Hubs Connector for Apache Spark.

Azure Event Hubs is a highly scalable publish-subscribe service that can ingest millions of events per second and stream them into multiple applications. Spark Streaming and Structured Streaming are scalable and fault-tolerant stream processing engines that allow users to process huge amounts of data using complex algorithms expressed with high-level functions like map, reduce, join, and window. This data can then be pushed to filesystems, databases, or even back to Event Hubs.

By making Event Hubs and Spark easier to use together, we hope this connector makes building scalable, fault-tolerant applications easier for our users.

Latest Releases

Spark

Spark Version Package Name Package Version
Spark 2.4 azure-eventhubs-spark_2.11 Maven Central
Spark 2.4 azure-eventhubs-spark_2.12 Maven Central
Spark 2.3 azure-eventhubs-spark_2.11 Maven Central
Spark 2.2 azure-eventhubs-spark_2.11 Maven Central
Spark 2.1 azure-eventhubs-spark_2.11 Maven Central

Databricks

Databricks Runtime Version Artifact Id Package Version
Databricks Runtime 6.X azure-eventhubs-spark_2.11 Maven Central
Databricks Runtime 6.X azure-eventhubs-spark_2.12 Maven Central
Databricks Runtime 5.X azure-eventhubs-spark_2.11 Maven Central
Databricks Runtime 5.X azure-eventhubs-spark_2.12 Maven Central
Databricks Runtime 4.X azure-eventhubs-spark_2.11 Maven Central
Databricks Runtime 3.5 azure-eventhubs-spark_2.11 Maven Central

Roadmap

There is an open issue for each planned feature/enhancement.

Usage

Linking

For Scala/Java applications using SBT/Maven project definitions, link your application with the artifact below. Note: See Latest Releases to find the correct artifact for your version of Apache Spark (or Databricks)!

groupId = com.microsoft.azure
artifactId = azure-eventhubs-spark_2.11
version = 2.3.15

or

groupId = com.microsoft.azure
artifactId = azure-eventhubs-spark_2.12
version = 2.3.15

Documentation

Documentation for our connector can be found here. The integration guides there contain all the information you need to use this library.

If you're new to Apache Spark and/or Event Hubs, then we highly recommend reading their documentation first. You can read Event Hubs documentation here, documentation for Spark Streaming here, and, the last but not least, Structured Streaming here.

FAQ

We maintain an FAQ - reach out to us via gitter if you think anything needs to be added or clarified!

Further Assistance

If you need additional assistance, please don't hesitate to ask! General questions and discussion should happen on our gitter chat. Please open an issue for bug reports and feature requests! Feedback, feature requests, bug reports, etc are all welcomed!

Contributing

If you'd like to help contribute (we'd love to have your help!), then go to our Contributor's Guide for more information.

Build Prerequisites

In order to use the connector, you need to have:

More details on building from source and running tests can be found in our Contributor's Guide.

Build Command

// Builds jar and runs all tests
mvn clean package

// Builds jar, runs all tests, and installs jar to your local maven repository
mvn clean install

azure-event-hubs-spark's People

Contributors

ammills01 avatar arijitt avatar basilhariri avatar codingcat avatar danmmsft avatar edouardpoitras avatar fokko avatar gison93 avatar gordontyler avatar jaceklaskowski avatar jgiardin avatar lenadroid avatar lucarosellini avatar myasuka avatar nyaghma avatar ppatierno avatar qaemma avatar romitgirdhar avatar sabeegrewal avatar sardinois avatar shanyu avatar sjkwak avatar slyons avatar spacerangerwes avatar sreeramgarlapati avatar tilumi avatar vjrantal avatar xaviergeerinck avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.