Giter Site home page Giter Site logo

jacobwgillespie / hydra Goto Github PK

View Code? Open in Web Editor NEW

This project forked from hydradatabase/hydra

0.0 1.0 0.0 4.1 MB

A modern, open source replacement for enterprise data warehouses

License: Apache License 2.0

Shell 0.95% Ruby 0.07% Python 4.78% C 74.36% Go 3.06% Makefile 1.21% PLpgSQL 13.97% HCL 0.91% M4 0.47% Dockerfile 0.21%

hydra's Introduction

Hydra - the open source data warehouse

The world’s fastest Postgres for analytics

Hydra is a modern, open source replacement for enterprise data warehouses. It’s fast and feature-rich so devs can build better analytics, quicker.

Hydra implements an open source columnar engine to Postgres, driving 23X query performance, better cache hit rates, and scalability over basic Postgres. When comparing to traditional warehouses, Hydra delivers 1500X more throughput to enable realtime analytical workloads.

Contents

💪 Benchmarks

Results in seconds, smaller is better.
Hydra - the open source data warehouse

Review Clickbench for comprehensive results and the list of 42 queries tested.

This benchmark represents typical workload in the following areas: clickstream and traffic analysis, web analytics, machine-generated data, structured logs, and events data. It covers the typical queries in ad-hoc analytics and real-time dashboards.

Hydra - the open source data warehouse

Transactions / Second (TPS)

Hydra delivers 1500X more throughput than traditional warehouses to enable realtime analytical workloads. This is accomplished with transactional heap tables.

Hydra Redshift
TPS 21988 15

View detailed results

🚀 Quick Start

Run Hydra locally

The Hydra Docker image is a drop-in replacement for postgres Docker image.

You can also try out Hydra locally using docker-compose.

git clone https://github.com/hydradatabase/hydra && cd hydra
cp .env.example .env
docker compose up
psql postgres://postgres:[email protected]:5432

Use Hydra Cloud

Hydra Cloud is the fastest and most reliable way to run Hydra. It is a cloud-based data warehouse that allows you to consolidate data from various sources into a single, unified system. It provides a user-friendly interface for automated data ingestion and transformation.

Hydra Cloud provides a scalable and secure cloud environment where automatic backups, resource scaling, high availability, point-in-time recovery, and more is available instantly with new databases.

Sign up for Hydra Cloud and get a free, managed database.

🎁 Features

🐘 hosted postgres database - docs
📊 columnar store with updates and deletes- docs
🔀 query parallelization
🔍 vectorized execution of WHERE clauses
🌐 external tables - docs

Hydra - the open source data warehouse

Read documentation on using Hydra’s columnar table access method.

🤝 Community and Status

DEVELOPERS.md for contributing and building the image.
Discord discussion with the Community and Hydra team
GitHub Discussions for longer topics
GitHub Issues for bugs and missing features
Blog for latest announcements, tutorials, product updates
@hydradatabase for the tweets, memes, and social posts
Docs for Hydra features and warehouse ops

follow the repo

  • Private Alpha: Limited to select design partners
  • Public Beta: Talk with Hydra team to learn more
  • Hydra 1.0 Release: Generally Available (GA) and ready for production use

Coming Soon

Watch releases of this repo to get notified of updates.

  • 🧹 vacuum stripe optimizations and space reclamation
  • 🏎️ vectorized execution of aggregate functions
  • 🚅 use of SIMD in vectorized execution
  • ↔️ separation of compute and storage

📝 License

Hydra is only possible by building on the shoulders of giants.

The code in this repo is licensed under:

The docker image is built on the Postgres docker image, which contains a large number of open source projects, including:

  • Postgres - the Postgres license
  • Debian or Alpine Linux image, depending on the image used
  • Hydra includes the following additional software in the image:
    • multicorn - BSD license
    • mysql_fdw - MIT-style license
    • parquet_s3_fdw - MIT-style license
    • pgsql-http - MIT license

As for any pre-built image usage, it is the image user's responsibility to ensure that any use of this image complies with any relevant licenses for all software contained within.

hydra's People

Contributors

0xflotus avatar amirzahre avatar georglink avatar jerrysievert avatar jhydra12 avatar mkaruza avatar nikolays avatar owenthereal avatar wuputah avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.