Objective: Building a ML system using PySpark by analysing the employee data and build a model to predict employee compensation.
mandira-sawkar / getpaid Goto Github PK
View Code? Open in Web Editor NEWCreated a SparkML RandomForest model to predict total employee compensation. Queried data with SparkSQL, ran PySpark scripts to run EDA, pre-process data, and train model achieving with 0.98 R2 score.