In this post I will show you how to submit a Spark job from Java code.
Typically, we submit Spark jobs to "Spark Cluster" and Hadoop/YARN by
using the $SPARK_HOME/bin/spark-submit shell script. Submitting
Spark job from a shell script limits programmers when they want to submit
Spark jobs from Java code (such as Java servlets or other Java code such
as REST servers).
spark-2.0.0
This package contains the following programs, which submit Spark jobs to Spark Cluster or to Hadoop/YARN.
| Program/File | Description |
|---|---|
| ConfigurationManager.java | Creates an Hadoop Configuration object |
| SubmitSparkJobToClusterFromJavaCode.java | Submits a Spark job to Spark cluster from Java code |
| SubmitSparkJobToYARNFromJavaCode.java | Submits a Spark job to Hadoop/YARN from Java code |
| SubmitSparkPiToClusterFromJavaCode.java | Submits a SparkPi to Spark cluster from Java code |
| SubmitSparkPiToClusterFromJavaCode.log | log file |
| SubmitSparkPiToYARNFromJavaCode.java | Submits a SparkPi to Hadoop/YARN from Java code |
| SubmitSparkPiToYARNFromJavaCode.log | log file |
| SubmitSparkPiToYARNFromJavaCode.stderr.html | stderr file |
| SubmitSparkPiToYARNFromJavaCode.stdout.html | stdout file |
Before running your Java code, make sure that the HDFS output directory does not exist:
hadoop fs -rm -R /friends/output
best regards,
Mahmoud Parsian
