Oswaldo Fuenmayor's Projects
Materials for a 3-day instructor led course on applying machine learning
Docker Apache Airflow
Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark
Distributed Big Data Orchestration Service
Code for Head First Design Patterns book (2014)
Example code from Learning Spark book
Mastering Apache Spark 2
Mirror of Apache Spark
Benchmark Suite for Apache Spark
Wordcount example using Spark with Scala
Native, optimized access to HBase Data through Spark SQL/Dataframe Interfaces
Add new branch for compatibility with CDH based on this repo: https://github.com/Huawei-Spark/Spark-SQL-on-HBase
Base classes to use when writing tests with Spark
A tool for monitoring and tuning Spark jobs for efficiency.
Sqoop on Apache Spark Engine