[SPARK-46912] Use worker JAVA_HOME and SPARK_HOME instead of from submitter #44943

thanhdanh1803 · 2024-01-30T03:30:23Z

What changes were proposed in this pull request?

Replace JAVA_HOME and SPARK_HOME from submitter by value of worker when building localCommand.

Why are the changes needed?

There is a problem when submit a job in cluster mode to a standalone cluster. The worker start a java job using value from submitter JAVA_HOME instead of itself.

Does this PR introduce any user-facing change?

No

Was this patch authored or co-authored using generative AI tooling?

No

srowen · 2024-01-30T15:43:50Z

Hm, how does JAVA_HOME get from the 'submitter' - what do you mean, the application submitter? but the worker is already running by that point

thanhdanh1803 · 2024-01-31T02:41:52Z

Hm, how does JAVA_HOME get from the 'submitter' - what do you mean, the application submitter? but the worker is already running by that point

The submitter is the client machine which run then command spark-submit (with deploy-mode = cluster).
The worker is already running at this point but the driver does not. When master received a submit request, it starts creating a driver on a worker, at that point, the driver copy the command and environment variable from submit command and use in its session. It sounds weird but that what I am facing in my case.

github-actions · 2024-05-11T00:18:55Z

We're closing this PR because it hasn't been updated in a while. This isn't a judgement on the merit of the PR in any way. It's just a way of keeping the PR queue manageable.
If you'd like to revive this PR, please reopen it and ask a committer to remove the Stale tag!

jywjyw · 2024-12-06T01:50:46Z

It's a bug. Try this command on machine A:

bin/spark-submit --master spark://{REMOTE}:7077 --deploy-mode cluster  --class org.apache.spark.examples.SparkPi  file:///opt/spark/examples/jars/spark-examples_2.12-3.5.3.jar

it will submit application to standalone cluster(important: deploy-mode=cluster), then error occurs:

Exception from cluster was: java.io.IOException: Cannot run program "/usr/java/default//bin/java" (in directory "/opt/bitnami/spark/work/driver-20241206013526-0001"): error=2, No such file or directory

Because the worker node searched java from "/usr/java/default//bin/java", but "/usr/java/default//bin/java" is machine A's java path, not worker's java path

MeltonSmith · 2025-06-30T07:18:40Z

I've opened a new PR: #51314

[fix] Use worker JAVA_HOME and SPARK_HOME instead of from submitter

bfadb49

github-actions bot added the CORE label Jan 30, 2024

github-actions bot added the Stale label May 11, 2024

github-actions bot closed this May 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-46912] Use worker JAVA_HOME and SPARK_HOME instead of from submitter #44943

[SPARK-46912] Use worker JAVA_HOME and SPARK_HOME instead of from submitter #44943

thanhdanh1803 commented Jan 30, 2024

Uh oh!

srowen commented Jan 30, 2024

Uh oh!

thanhdanh1803 commented Jan 31, 2024

Uh oh!

github-actions bot commented May 11, 2024

Uh oh!

jywjyw commented Dec 6, 2024

Uh oh!

MeltonSmith commented Jun 30, 2025

Uh oh!

Uh oh!

[SPARK-46912] Use worker JAVA_HOME and SPARK_HOME instead of from submitter #44943

[SPARK-46912] Use worker JAVA_HOME and SPARK_HOME instead of from submitter #44943

Conversation

thanhdanh1803 commented Jan 30, 2024

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

Was this patch authored or co-authored using generative AI tooling?

Uh oh!

srowen commented Jan 30, 2024

Uh oh!

thanhdanh1803 commented Jan 31, 2024

Uh oh!

github-actions bot commented May 11, 2024

Uh oh!

jywjyw commented Dec 6, 2024

Uh oh!

MeltonSmith commented Jun 30, 2025

Uh oh!

Uh oh!