YouTube Dataset Analysis - MapReduce Design Patterns
University project for Advanced Hadoop MapReduce Programming (ADBMS) course.
• Implemented Apache Hadoop big data framework to analyze data with help of HDFS, map-reduce design patterns, Pig, HBase.
• Enhanced and modified map reduce job by additional file merging code, combiner optimization, partition block.