Scala and spark for statistics data of I/O
sudo apt-get update
sudo apt-get install default-jdk
sudo apt-get install scala
Download latest Spark https://spark.apache.org/downloads.html
Create a spark folder in /usr/local/ after untar the tgz:
sudo tar xvf spark-2.X.X-bin-hadoop2.7.tgz -C /usr/local/spark
Add Spark path to bash file:
vim ~/.bashrc
Add below code snippet to the bash file:
SPARK_HOME=/usr/local/spark
export PATH=$SPARK_HOME/bin:$PATH
source ~/.bashrc