Zhu Zhu created FLINK-20626:
-------------------------------
Summary: Canceling a job when it is failing will result in job hanging in CANCELING state
Key: FLINK-20626
URL:
https://issues.apache.org/jira/browse/FLINK-20626 Project: Flink
Issue Type: Bug
Components: Runtime / Coordination
Affects Versions: 1.11.2, 1.12.0
Reporter: Zhu Zhu
Assignee: Zhu Zhu
Fix For: 1.13.0, 1.11.4, 1.12.1
If user manually cancels a job when the job is failing(here failing means the job encounters unrecoverable failure and is about to fail), the job will hang in CANCELING state and cannot terminate. The cause is that DefaultScheduler currently will always try to transition from `FAILING` to `FAILED` to terminate the job. However, job canceling will change job status to `CANCELING` so that the transition to `FAILED` will not success.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)