Till Rohrmann created FLINK-16866:
-------------------------------------
Summary: Make job submission non-blocking
Key: FLINK-16866
URL:
https://issues.apache.org/jira/browse/FLINK-16866 Project: Flink
Issue Type: Improvement
Components: Runtime / Coordination
Affects Versions: 1.10.0, 1.9.2, 1.11.0
Reporter: Till Rohrmann
Fix For: 1.11.0
Currently, Flink waits to acknowledge a job submission until the corresponding {{JobManager}} has been created. Since its creation also involves the creation of the {{ExecutionGraph}} and potential FS operations, it can take a bit of time. If the user has configured a too low {{web.timeout}}, the submission can time out only reporting a {{TimeoutException}} to the user.
I propose to change the notion of job submission slightly. Instead of waiting until the {{JobManager}} has been created, a job submission is complete once all job relevant files have been uploaded to the {{Dispatcher}} and the {{Dispatcher}} has been told about it. Creating the {{JobManager}} will then belong to the actual job execution. Consequently, if problems occur while creating the {{JobManager}} it will result into a job failure.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)