Giter Site home page Giter Site logo

Ajit Prasad's Projects

bigdata_casestudyii_hive-hbase icon bigdata_casestudyii_hive-hbase

This case study assignment is aimed at consolidating the concepts that was learnt during the various session of Hive & HBase of the course.

bigdata_casestudyiv_sparkstreaming icon bigdata_casestudyiv_sparkstreaming

Demo of Spark application that reads the data from local file system & does some transformation & action on the fly. As & when files keep coming within the directory the application fetches the file automatically & performs successive transformation & action there by providing the desired output simultaneously. Application also moves the files onto HDFS perform some transformations & actions on HDFS as well & then compares both the results to produce a final message of success/failure in case the output file matches/does_not_match.

bigdata_session12assignment12.1 icon bigdata_session12assignment12.1

Analysis & import of Twitter Data onto HDFS. Tweet search based on certain keys that needs to added within configuration file & them Streaming Data in using Flume agent

bigdata_session14assignment14.1 icon bigdata_session14assignment14.1

Scala Basics. Exploring filter, apply, map, Lambda, count, length & various String Functions. Exploring list, tuples, arrays, sets, nested tuples & many more

bigdata_session15assignment15.1 icon bigdata_session15assignment15.1

Scala applications to find GCD (Greatest Common Divisor ) of two numbers, Fibonacci Series using Standard Loop & Recursion Method, and Square Root of a number using Babylonian Method of Divide & Average.

bigdata_session19assignment19.1 icon bigdata_session19assignment19.1

Analysis of a college student dataset using Spark RDD. Demo of various operations on RDD such as countByValue, groupBy, groupByKey, reduceByKey,etc. Demo of map, flatmap, split, explicit, filter, type conversion, finding sum, count, distinct, aggregate, length of RDD. Demo of String, int comparisons, UNION & intersetion on RDD's.

bigdata_session1assignment1.1 icon bigdata_session1assignment1.1

This assignment helps to consolidate the concept learned in the opening session of BigData Engineering with Hadoop & Spark.

bigdata_session22assignment22.1 icon bigdata_session22assignment22.1

Understanding Kafka, starting a broker/leader that pushes messages to be consumed at consumer. Demonstrating ways to create multi-node replicas with multiple partitions. Different ways to create a topic(keyed/keyless) based on the key property. Formatting messages with custom values of key value separator & reading the same consumed messages at consumer using custom format using key separator property.

bigdata_session4assignment4.1 icon bigdata_session4assignment4.1

This assignment is aimed at consolidating the concepts that was learned during the MapReduce Introduction session of the course.

bigdata_session5assignment5.1 icon bigdata_session5assignment5.1

This assignment is aimed at consolidating the concepts that was learned during the Advance MapReduce session of the course.

bigdata_session6assignment6.1 icon bigdata_session6assignment6.1

This assignment is aimed at consolidating the concepts that was learnt during the Data Indigestion Tool Sqoop session of the course.

bigdata_session7assignment7.1 icon bigdata_session7assignment7.1

This assignment is aimed at consolidating the concepts that was learnt during the Exploring Apache Pig session of the course.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.