* Update macos.md added a useful information regarding the path where java is installed, if someone has problems to find it * Update macos.md again, just made clear how to get the path to the installation (spark)
1.6 KiB
MacOS
Here we'll show you how to install Spark 3.2.1 for MacOS. We tested it on MacOS Monterey 12.0.1, but it should work for other MacOS versions as well
Installing Java
Ensure Brew and Java installed in your system:
xcode-select –install
/bin/bash -c "$(curl -fsSL https://raw.githubusercontent.com/Homebrew/install/master/install.sh)"
brew install java
Add the following environment variables to your .bash_profile or .zshrc:
export JAVA_HOME=/usr/local/Cellar/openjdk@11/11.0.12
export PATH="$JAVA_HOME/bin/:$PATH"
Make sure Java was installed to /usr/local/Cellar/openjdk@11/11.0.12: Open Finder > Press Cmd+Shift+G > paste "/usr/local/Cellar/openjdk@11/11.0.12". If you can't find it, then change the path location to appropriate path on your machine. You can also run brew info java to check where java was installed on your machine.
Installing Spark
- Install Scala
brew install scala@2.11
- Install Apache Spark
brew install apache-spark
- Add environment variables:
Add the following environment variables to your .bash_profile or .zshrc. Replace the path to SPARK_HOME to the path on your own host. Run brew info apache-spark to get this.
export SPARK_HOME=/usr/local/Cellar/apache-spark/3.2.1/libexec
export PATH="$SPARK_HOME/bin/:$PATH"
Testing Spark
Execute spark-shell and run the following in scala:
val data = 1 to 10000
val distData = sc.parallelize(data)
distData.filter(_ < 10).collect()
PySpark
It's the same for all platforms. Go to pyspark.md.