Topic: bigdata Goto Github
Some thing interesting about bigdata
Some thing interesting about bigdata
bigdata,Upserts, Deletes And Incremental Processing on Big Data.
Organization: apache
Home Page: https://hudi.apache.org/
bigdata,Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.
Organization: apache
Home Page: https://celeborn.apache.org/
bigdata,Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.
Organization: apache
Home Page: https://livy.apache.org/
bigdata,Distributed SQL transaction & query engine for data sharding, scaling, encryption, and more - on any database.
Organization: apache
bigdata,Fast topic modeling platform
Organization: bigartm
Home Page: http://bigartm.org/
bigdata,Byzer (former MLSQL): A low-code open-source programming language for data pipeline, analytics and AI.
Organization: byzer-org
Home Page: https://www.byzer.org
bigdata,This is a repo with links to everything you'd ever want to learn about data engineering
Organization: dataexpert-io
bigdata,𝗗𝗮𝘁𝗮, 𝗔𝗻𝗮𝗹𝘆𝘁𝗶𝗰𝘀 & 𝗔𝗜. Modern alternative to Snowflake. Cost-effective and simple for massive-scale analytics. https://databend.com
Organization: datafuselabs
Home Page: https://docs.databend.com
bigdata,.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
Organization: dotnet
Home Page: https://dot.net/spark
bigdata,A data integration framework
Organization: dtstack
Home Page: https://dtstack.github.io/chunjun/
bigdata,A book about running Elasticsearch
User: fdv
Home Page: https://fdv.github.io/running-elasticsearch-fun-profit/
bigdata,Lightweight real-time big data streaming engine over Akka
Organization: gearpump
Home Page: https://gearpump.github.io/gearpump/
bigdata,GridDB is a next-generation open source database that makes time series IoT and big data fast,and easy.
Organization: griddb
Home Page: https://griddb.org/
bigdata,大数据入门指南 :star:
User: heibaiying
bigdata,:truck: Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
Organization: hi-primus
Home Page: https://hi-optimus.com
bigdata,100+套大数据可视化炫酷大屏Html5模板;包含行业:社区、物业、政务、交通、金融银行等,全网最新、最多,最全、最酷、最炫大数据可视化模板。陆续更新中
User: igaowei
Home Page: https://igaowei.github.io/BigDataView/
bigdata,懒松鼠Flink-Boot 脚手架让Flink全面拥抱Spring生态体系,使得开发者可以以Java WEB开发模式开发出分布式运行的流处理程序,懒松鼠让跨界变得更加简单。懒松鼠旨在让开发者以更底上手成本(不需要理解分布式计算的理论知识和Flink框架的细节)便可以快速编写业务代码实现。为了进一步提升开发者使用懒松鼠脚手架开发大型项目的敏捷的度,该脚手架默认集成Spring框架进行Bean管理,同时将微服务以及WEB开发领域中经常用到的框架集成进来,进一步提升开发速度。比如集成Mybatis ORM框架,Hibernate Validator校验框架,Spring Retry重试框架等,具体见下面的脚手架特性。
User: intsmaze
bigdata,An on-line movie recommender using Spark, Python Flask, and the MovieLens dataset
User: jadianes
bigdata,Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
User: jadianes
Home Page: http://jadianes.github.io/spark-py-notebooks
bigdata,学习记录的一些笔记,以及所看得一些电子书eBooks、视频资源和平常收纳的一些自己认为比较好的博客、网站、工具。涉及大数据几大组件、Python机器学习和数据分析、Linux、操作系统、算法、网络等
User: josonle
bigdata,JuiceFS is a distributed POSIX file system built on top of Redis and S3.
Organization: juicedata
Home Page: https://juicefs.com
bigdata,A batch scheduler of kubernetes for high performance workload, e.g. AI/ML, BigData, HPC
Organization: kubernetes-retired
bigdata,🔨 用 JSON 来生成结构化的 SQL 语句,基于 Vue3 + TypeScript + Vite + Ant Design + MonacoEditor 实现,项目简单(重逻辑轻页面)、适合练手~
User: liyupi
Home Page: http://sql.yupi.icu
bigdata,Distributed Big Data Orchestration Service
Organization: netflix
Home Page: https://netflix.github.io/genie
bigdata,A curated list of awesome big data frameworks, ressources and other awesomeness.
User: newtendermint
Home Page: https://github.com/onurakpolat/awesome-bigdata
bigdata,First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.
Organization: opendatadiscovery
Home Page: https://opendatadiscovery.org
bigdata,CORTX Community Object Storage is 100% open source object storage uniquely optimized for mass capacity storage devices.
Organization: seagate
Home Page: https://github.com/Seagate/cortx
bigdata,An easy-to-use BI server built for SQL lovers. Power data analysis in SQL and gain faster business insights.
User: shzlw
Home Page: https://shzlw.github.io/poli
bigdata,TDengine is an open source, high-performance, cloud native time-series database optimized for Internet of Things (IoT), Connected Cars, Industrial IoT and DevOps.
Organization: taosdata
Home Page: https://tdengine.com
bigdata,TensorBase is a new big data warehousing with modern efforts.
Organization: tensorbase
Home Page: https://tensorbase.io/
bigdata,Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀
Organization: vaexio
Home Page: https://vaex.io
bigdata,GUI-based Python code generator for data science, extension to Jupyter Lab, Jupyter Notebook and Google Colab.
User: visualpython
Home Page: https://www.visualpython.ai
bigdata,A Cloud Native Batch System (Project under CNCF)
Organization: volcano-sh
Home Page: https://volcano.sh
bigdata,专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...
User: wangzhiwubigdata
bigdata,WeDataSphere is a financial grade, one-stop big data platform suite.
Organization: webankfintech
bigdata,Google, Naver multiprocess image web crawler (Selenium)
User: yoongikim
bigdata,Data syncing in golang for ClickHouse.
Organization: zeromicro
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.