Till Rohrmann created FLINK-5197:
------------------------------------
Summary: Late JobStatusChanges can interfere with running jobs
Key: FLINK-5197
URL:
https://issues.apache.org/jira/browse/FLINK-5197 Project: Flink
Issue Type: Bug
Components: JobManager
Affects Versions: 1.1.3, 1.2.0
Reporter: Till Rohrmann
Assignee: Till Rohrmann
Priority: Minor
Fix For: 1.2.0, 1.1.4
When the {{JobManager}} receives a {{JobStatusChanged}} message, it will look up the {{ExecutionGraph}} for the given {{JobID}}. If there is no {{ExecutionGraph}}, then a {{RemoveJob}} message is sent to itself. In the general case, this is not problematic, because the {{RemoveJob}} message won't do anything if there is no {{ExecutionGraph}}. However, since this is an asynchronous call, it can happen that the corresponding job of the {{JobID}} is recovered before receiving the {{RemoveJob}} message. In this case, the newly recovered job would be removed.
I propose to change the behaviour such that a {{JobStatusChanged}} for a non-existing {{ExecutionGraph}} will be simply ignored.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)