1. department_dataset
2. employee_bonus
3. employee_dataset
1. commons-csv-1.5
2. spark-csv_2.10-1.5.0
spark-shell --master yarn --jars commons-csv-1.5.jar,spark-csv_2.10-1.5.0.jar
-
Create a relation with employee dataset with schema (column names and datatype) and name it employee. Once created, describe the relation.
-
Select the columns from the employee relation. Display 10 of these records on your screen.
-
Create a relation with department dataset with schema (column names and datatype) and name it departments. Once created, describe the relation.
-
Select the columns from the department relation. Display 10 of these records on your screen.
-
Create a relation with bonus dataset with schema (column names and datatype) and name it bonus. Once created, describe the relation.
-
Select the columns from the bonus relation. Display 10 of these records on your screen.
-
Join employee and department dataset to display Department Name and Department Address (as 3 separate columns - street, city and state) for each row in employee. Display 10 of these records on your screen.
-
Display all the employees who received bonus. The result should not display any employees who did not receive a bonus. Display 10 of these records on your screen.
-
Display the average bonus by department.
-
Display the number of employees in each department.