Topic: hadoop Goto Github
Some thing interesting about hadoop
Some thing interesting about hadoop
hadoop,Alluxio, data orchestration for analytics and machine learning in the cloud
Organization: alluxio
Home Page: https://www.alluxio.io
hadoop,Apache Calcite
Organization: apache
Home Page: https://calcite.apache.org/
hadoop,High performance data store solution
Organization: apache
Home Page: carbondata.apache.org
hadoop,Apache Hadoop
Organization: apache
Home Page: https://hadoop.apache.org/
hadoop,Apache Ignite
Organization: apache
Home Page: https://ignite.apache.org/
hadoop,Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
Organization: apache
Home Page: https://kyuubi.apache.org/
hadoop,Apache Nutch is an extensible and scalable web crawler
Organization: apache
Home Page: https://nutch.apache.org/
hadoop,Scalable, redundant, and distributed object store for Apache Hadoop
Organization: apache
Home Page: https://ozone.apache.org
hadoop,Apache Hadoop docker image
Organization: big-data-europe
hadoop,Distributed Deep Learning, with a focus on distributed training, using Keras and Apache Spark.
Organization: cerndb
Home Page: http://joerihermans.com/work/distributed-keras/
hadoop,Suite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running math code and a java based math library on top of the core c++ library. Also includes samediff: a pytorch/tensorflow like library for running deep learn...
Organization: deeplearning4j
Home Page: http://deeplearning4j.konduit.ai
hadoop,Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
User: donnemartin
hadoop,Taier is a big data development platform for submission, scheduling, operation and maintenance, and indicator information display
Organization: dtstack
Home Page: https://dtstack.github.io/Taier/
hadoop,The GIS Tools for Hadoop are a collection of GIS tools for spatial analysis of big data.
Organization: esri
Home Page: http://esri.github.io/gis-tools-for-hadoop/
hadoop,深圳地铁大数据客流分析系统🚇🚄🌟
User: geekyouth
Home Page: https://github.com/geekyouth/SZT-bigdata
hadoop,H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Organization: h2oai
Home Page: http://h2o.ai
hadoop,1000+ DevOps Bash Scripts - AWS, GCP, Kubernetes, Docker, CI/CD, APIs, SQL, PostgreSQL, MySQL, Hive, Impala, Kafka, Hadoop, Jenkins, GitHub, GitLab, BitBucket, Azure DevOps, TeamCity, Spotify, MP3, LDAP, Code/Build Linting, pkg mgmt for Linux, Mac, Python, Perl, Ruby, NodeJS, Golang, Advanced dotfiles: .bashrc, .vimrc, .gitconfig, .screenrc, tmux..
User: harisekhon
Home Page: https://www.linkedin.com/in/HariSekhon
hadoop,80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Functions, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc.
User: harisekhon
Home Page: https://www.linkedin.com/in/HariSekhon
hadoop,50+ DockerHub public images for Docker & Kubernetes - DevOps, CI/CD, GitHub Actions, CircleCI, Jenkins, TeamCity, Alpine, CentOS, Debian, Fedora, Ubuntu, Hadoop, Kafka, ZooKeeper, HBase, Cassandra, Solr, SolrCloud, Presto, Apache Drill, Nifi, Spark, Consul, Riak
User: harisekhon
Home Page: https://www.linkedin.com/in/HariSekhon
hadoop,450+ AWS, Hadoop, Cloud, Kafka, Docker, Elasticsearch, RabbitMQ, Redis, HBase, Solr, Cassandra, ZooKeeper, HDFS, Yarn, Hive, Presto, Drill, Impala, Consul, Spark, Jenkins, Travis CI, Git, MySQL, Linux, DNS, Whois, SSL Certs, Yum Security Updates, Kubernetes, Cloudera etc...
User: harisekhon
Home Page: https://www.linkedin.com/in/HariSekhon
hadoop,大数据入门指南 :star:
User: heibaiying
hadoop,At LinkedIn, we are using this curriculum for onboarding our entry-level talents into the SRE role.
Organization: linkedin
Home Page: https://linkedin.github.io/school-of-sre/
hadoop,基于Spark的电影推荐系统,包含爬虫项目、web网站、后台管理系统以及spark推荐系统
User: luckyzxl2016
hadoop,MooseFS – Open Source, Petabyte, Fault-Tolerant, Highly Performing, Scalable Network Distributed File System (Software-Defined Storage)
Organization: moosefs
Home Page: https://moosefs.com
hadoop,More than 2000+ Data engineer interview questions.
User: obenner
hadoop,电商用户行为分析大数据平台
User: oeljeklaus-you
hadoop,AI on Hadoop
Organization: qihoo360
hadoop,定期更新Hadoop生态圈中常用大数据组件文档 重心依次为: Flink Solr Sparksql ES Scala Kafka Hbase/phoenix Redis Kerberos (项目包含hadoop思维导图 印象笔记 Scala版本简单demo 常用工具类 去敏后的train code 持续更新!!!)
User: realguoshuai
hadoop,Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
Organization: spotify
hadoop,🏆 实时 零代码、全功能、强安全 ORM 库 🚀 后端接口和文档零代码,前端(客户端) 定制返回 JSON 的数据和结构 🏆 Real-Time coding-free, powerful and secure ORM 🚀 providing APIs and Docs without coding by Backend, and the returned JSON of API can be customized by Frontend(Client) users
Organization: tencent
Home Page: http://apijson.cn
hadoop,Kylo is a data lake management software platform and framework for enabling scalable enterprise-class data lakes on big data technologies such as Teradata, Apache Spark and/or Hadoop. Kylo is licensed under Apache 2.0. Contributed by Teradata Inc.
Organization: teradata
Home Page: http://kylo.io
hadoop,Example source code accompanying O'Reilly's "Hadoop: The Definitive Guide" by Tom White
User: tomwhite
Home Page: http://www.hadoopbook.com/
hadoop,TonY is a framework to natively run deep learning frameworks on Apache Hadoop.
Organization: tony-framework
Home Page: https://tony-project.ai
hadoop,Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Organization: trinodb
Home Page: https://trino.io
hadoop,专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...
User: wangzhiwubigdata
hadoop,DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.
Organization: webankfintech
Home Page: https://github.com/WeBankFinTech/DataSphereStudio-Doc
hadoop,WeDataSphere is a financial grade, one-stop big data platform suite.
Organization: webankfintech
hadoop,Addax is a versatile open-source ETL tool that can seamlessly transfer data between various RDBMS and NoSQL databases, making it an ideal solution for data migration.
User: wgzhao
Home Page: https://wgzhao.github.io/Addax/
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.