Giter Site home page Giter Site logo

vadymurupa / apache-pinot-from-docker Goto Github PK

View Code? Open in Web Editor NEW
0.0 1.0 0.0 6 KB

Apache Pinot is a real-time distributed online analytical processing (OLAP) datastore. Use Pinot to ingest and immediately query data from streaming or batch data sources (including, Apache Kafka, Amazon Kinesis, Hadoop HDFS, Amazon S3, Azure ADLS, and Google Cloud Storage).

apache-pinot-from-docker's Introduction

Apache Pinot

original source:

https://www.youtube.com/watch?v=cNnwMF0pOJ8&list=PLihIrF0tCXdeimVCZwuejXb7FkjsyN9_k&index=8

https://github.com/npawar/pinot-tutorial

This repository is created to build and gether all services in one docker-compose file.

Apache Pinot is a real-time OLAP data store that can provide ultra low latency even at high throughput. It can ingest data from batch data sources such as Hadoop, S3, Azure and Google cloud storage or from streaming data sources such as Kafka, EventHub, Kinesis.

How to setup a Pinot cluster

In the tutorial, we will setup a Pinot cluster with the following components

  • 1 zookeeper
  • 3 controllers
  • 2 brokers
  • 2 servers Once the cluster is up and running, we see how to load data into Pinot and query it. At the end, we show how Pinot is resilient to failures

sudo docker-compose exec pinot-controller-1 bin/pinot-admin.sh AddTable -tableConfigFile /rawData/transcript-table-offline.json -schemaFile /rawData/transcript-schema.json -controllerHost pinot-controller-1 -controllerPort 9001 -exec

Here -controllerHost pinot-controller-1 indicates that the command should access the pinot-controller-1 service, which is one of the services defined in your docker-compose.yml file.

This command will start the process of adding the table and schema to Pinot using the files located in the mounted /rawData folder inside the container.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.