Installing Spark
Installing Spark
Installing Spark
MANAGEMENT
Installing Spark
Steps for Installing spark
1.Download and Install Hadoop
Hadoop can be downloaded from the link below:
http://hadoop.apache.org/releases.html
Because this version does not include windows 10 binary, refer the
to tutorial for installing hadoop in Windows environment:
Steps for Installing spark
2. Download and Install Java virtual machine.
The latest version can be downloaded from
http://www.oracle.com/technetwork/java/javase/downloads/inde
x.html
Spark folder within drive and be used to storing the unzipped file.
Steps for Installing spark
5. Create SPARK HOME variable
This is an example.
Steps for Installing spark
6. edit the Path and add SPARK_HOME variable as follows
%SPARK_HOME%\bin
Example
Steps for Installing spark
5. Grant permission to temp folder
Create a temp folder under c:\tmp
Execute below command as administrator.
winutils.exe chmod 777 C:\tmp
Steps for Installing spark
6. Test spark by typing "spark-shell“ command on the prompt
Example
Closing Spark Shell
To close Spark shell, press Ctrl+D or type in :q
Example
Connecting spark with python
Pyspark command is used to connect spark with python
Example
References for installing Spark
https://hernandezpaul.wordpress.com/2016/01/24/apache-spark-
installation-on-windows-10/
https://github.com/Cheng-Lin-Li/Spark/wiki/How-to-install-Spark-
2.1.0-in-Windows-10-environment