[jira] [Created] (FLINK-1556) JobClient does not wait until a job failed completely if submission exception

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Created] (FLINK-1556) JobClient does not wait until a job failed completely if submission exception

Shang Yuanchun (Jira)
Till Rohrmann created FLINK-1556:
------------------------------------

             Summary: JobClient does not wait until a job failed completely if submission exception
                 Key: FLINK-1556
                 URL: https://issues.apache.org/jira/browse/FLINK-1556
             Project: Flink
          Issue Type: Bug
            Reporter: Till Rohrmann


If an exception occurs during job submission the {{JobClient}} received a {{SubmissionFailure}}. Upon receiving this message, the {{JobClient}} terminates itself and returns the error to the {{Client}}. This indicates to the user that the job has been completely failed which is not necessarily true.

If the user directly after such a failure submits another job, then it might be the case that not all slots of the formerly failed job are returned. This can lead to a {{NoRessourceAvailableException}}.

We can solve this problem by waiting for the completion of the job failure in the {{JobClient}}.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)