iajitprasad Goto Github PK
Name: Ajit Prasad
Type: User
Bio: Learner
Location: India
Name: Ajit Prasad
Type: User
Bio: Learner
Location: India
This assignment is aimed at consolidating the concepts that was learnt during the MapReduce & Apache Pig.
This case study assignment is aimed at consolidating the concepts that was learnt during the various session of Hive & HBase of the course.
Working with Sensor using Apache Spark, Spark SQL
Demo of Spark application that reads the data from local file system & does some transformation & action on the fly. As & when files keep coming within the directory the application fetches the file automatically & performs successive transformation & action there by providing the desired output simultaneously. Application also moves the files onto HDFS perform some transformations & actions on HDFS as well & then compares both the results to produce a final message of success/failure in case the output file matches/does_not_match.
Analysis of Hospital Data from United Stated using Apache Spark, Spark SQL
This assignment is aimed at consolidating the concepts that was learnt during the HBase Basics session of the course.
Advance HBase concepts. Importing large files from HDFS into HBase tables directly. Understanding HBase read/write & Architecture.
Analysis & import of Twitter Data onto HDFS. Tweet search based on certain keys that needs to added within configuration file & them Streaming Data in using Flume agent
Scala Basics. Exploring filter, apply, map, Lambda, count, length & various String Functions. Exploring list, tuples, arrays, sets, nested tuples & many more
Scala applications to find GCD (Greatest Common Divisor ) of two numbers, Fibonacci Series using Standard Loop & Recursion Method, and Square Root of a number using Babylonian Method of Divide & Average.
Creating a simple calculator using Scala
Inheritance & Multiple Inheritance, Partial Function and Match Case in Scala
Solving exercise problems using Spark RDD. Limitations of MapReduce and Features & Operations of Spark RDD.
Analysis of a college student dataset using Spark RDD. Demo of various operations on RDD such as countByValue, groupBy, groupByKey, reduceByKey,etc. Demo of map, flatmap, split, explicit, filter, type conversion, finding sum, count, distinct, aggregate, length of RDD. Demo of String, int comparisons, UNION & intersetion on RDD's.
This assignment helps to consolidate the concept learned in the opening session of BigData Engineering with Hadoop & Spark.
Travel Data Exercise using Scala SQL
Sports Data Exercise using Scala SQL and UDFs.
Understanding Kafka, starting a broker/leader that pushes messages to be consumed at consumer. Demonstrating ways to create multi-node replicas with multiple partitions. Different ways to create a topic(keyed/keyless) based on the key property. Formatting messages with custom values of key value separator & reading the same consumed messages at consumer using custom format using key separator property.
Apache Kafka. Working with Datasets using Java Classes.
Apache Spark Streaming examples using Netcat. Read streams of data to find sum, also find out the offensive words.
This assignment is aimed at consolidating the concepts that was learned during the HDFS session of the course.
This assignment is aimed at consolidating the concepts that was learned during the YARN session of the course.
This assignment is aimed at consolidating the concepts that was learned during the MapReduce Introduction session of the course.
This assignment is aimed at consolidating the concepts that was learned during the Advance MapReduce session of the course.
This assignment is aimed at consolidating the concepts that was learnt during the Data Indigestion Tool Sqoop session of the course.
This assignment is aimed at consolidating the concepts that was learnt during the Exploring Apache Pig session of the course.
This assignment is aimed at consolidating the concepts that was learnt during the Hive Basics session of the course.
This assignment is aimed at consolidating the concepts that was learnt during the Advance Hive session of the course.
Twitter's Effective Scala Guide
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.