Giter Site home page Giter Site logo

zzzhacker / twitter-kafka-streaming-service Goto Github PK

View Code? Open in Web Editor NEW
1.0 1.0 0.0 32 KB

This repository showcases a Kafka-based data streaming application that streams Twitter data using various related technologies such as Kafka brokers, schema registry, and Kafka Proxy. It includes custom stream classes, Consumer and Producer classes, with and without Avro serialization.

Python 100.00%
avro-kafka avro-schema-registry kafka kafka-consumer kafka-producer kafka-streams

twitter-kafka-streaming-service's Introduction

Twitter-Kafka-Streaming-Service

This project is a demonstration of how to stream Twitter data using Kafka and various related technologies such as ZooKeeper, Kafka Proxy, multiple Kafka brokers, Schema Registry, and Schema Registry-UI services. In addition, I developed custom Twitter stream classes, Consumer and Producer classes, with and without Avro serialization and connected it with Confluent schema registry.

What I Have Done

  • Created custom Twitter stream classes, Consumer and Producer classes, to stream Twitter data into Kafka
  • Set up a Kafka cluster with multiple brokers for efficient processing of Twitter data
  • Implemented schema registry for storing and managing data schemas, and ensured compatibility between the producer and consumer schemas
  • Configured Kafka Proxy for secure and reliable communication between the Kafka cluster and other services
  • Connected a user interface for Schema Registry using the Schema Registry-UI service

Technologies Used

  • Python
  • Kafka
  • ZooKeeper
  • Kafka Proxy
  • Schema Registry
  • Schema Registry-UI
  • Docker

What I Learned

  • Working with various components of a Kafka-based data streaming system such as brokers, schema registry, Kafka Proxy, and custom stream classes
  • Efficient processing of high volumes of data using a distributed messaging system
  • Ensuring data compatibility and consistency with the help of schema registry
  • Working with Docker to containerize Kafka streaming applications and dependencies for easy deployment and scaling
  • Insights into the nuances of working with Twitter's API and real-time data streaming from social media platforms
  • Becoming a more proficient data engineer with a solid understanding of data streaming using Kafka.

twitter-kafka-streaming-service's People

Contributors

zzzhacker avatar

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.