[jira] [Created] (FLINK-16728) Taskmanager dies after job got stuck and canceling fails

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Created] (FLINK-16728) Taskmanager dies after job got stuck and canceling fails

Shang Yuanchun (Jira)
Leonid Ilyevsky created FLINK-16728:
---------------------------------------

             Summary: Taskmanager dies after job got stuck and canceling fails
                 Key: FLINK-16728
                 URL: https://issues.apache.org/jira/browse/FLINK-16728
             Project: Flink
          Issue Type: Bug
    Affects Versions: 1.10.0
            Reporter: Leonid Ilyevsky
         Attachments: taskmanager.log.20200323.gz

At some point I noticed that a few jobs got stuck (they basically stopped processing the messages, I could detect this watching the expected output), so I tried to cancel them.

The cancel operation failed, complaining that the job got stuck at 

StreamTaskActionExecutor$SynchronizedStreamTaskActionExecutor.run(StreamTaskActionExecutor.java:86)

and then the whole taskmanager shut down.

See the attached log.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)