Deployment of Spark on a separate machine
MashZone NextGen Explorer integrates Apache Spark via Spark JobServer. This provides the benefit that MashZone NextGen Explorer and Spark do not necessarily have to be deployed on the same machine, i.e., you can install MashZone NextGen Explorer on server A and Spark JobServer on server B (whereby B could be a more capable machine than A).
Procedure
1. Copy the folder <MashZone NextGen Explorer installation>/va-sjs to the target machine where you want to run Spark JobServer.
2. Start Spark JobServer on the target machine via
running va-sjs\bin\va-sjs.bat (Windows)
running va-sjs/bin/va-sjs (Linux / Unix-derivates)
3. On the MashZone NextGen Explorer machine, open the config.json configuration file in a text editor.
4. Add the following code lines and save your changes.
"spark.enabled" : true,
"spark.jobserver.url" : "https://<target machine IP>:8090",
Whereby <target machine IP> is the IPv4-address of the machine running Spark JobServer, e.g., "spark.jobserver.url" : "https://172.16.0.1:8090", if the server has the IP 172.16.0.1.
Your settings are applied.