[jira] [Created] (FLINK-20626) Canceling a job when it is failing will result in job hanging in CANCELING state

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Created] (FLINK-20626) Canceling a job when it is failing will result in job hanging in CANCELING state

Shang Yuanchun (Jira)
Zhu Zhu created FLINK-20626:
-------------------------------

             Summary: Canceling a job when it is failing will result in job hanging in CANCELING state
                 Key: FLINK-20626
                 URL: https://issues.apache.org/jira/browse/FLINK-20626
             Project: Flink
          Issue Type: Bug
          Components: Runtime / Coordination
    Affects Versions: 1.11.2, 1.12.0
            Reporter: Zhu Zhu
            Assignee: Zhu Zhu
             Fix For: 1.13.0, 1.11.4, 1.12.1


If user manually cancels a job when the job is failing(here failing means the job encounters unrecoverable failure and is about to fail),  the job will hang in CANCELING state and cannot terminate. The cause is that DefaultScheduler currently will always try to transition from `FAILING` to `FAILED` to terminate the job. However, job canceling will change job status to `CANCELING` so that the transition to `FAILED` will not success.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)