[jira] [Created] (FLINK-1518) Laggy task cancelling on task managers

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Created] (FLINK-1518) Laggy task cancelling on task managers

Shang Yuanchun (Jira)
Ufuk Celebi created FLINK-1518:
----------------------------------

             Summary: Laggy task cancelling on task managers
                 Key: FLINK-1518
                 URL: https://issues.apache.org/jira/browse/FLINK-1518
             Project: Flink
          Issue Type: Improvement
          Components: Distributed Runtime
    Affects Versions: master
            Reporter: Ufuk Celebi


When cancelling a task on a cluster setup via the web interface I've noticed that task cancellation is laggy. The tasks switch to the CANCELING state, but the cancelling of the tasks takes some time.

There are also corresponding log messages:

{code}
14:29:29,281 WARN  org.apache.flink.runtime.execution.RuntimeEnvironment         - Task CHAIN DataSource (at getTextDataSet(WordCount.java:141) (org.apache.flink.api.java.io.TextInputF
/02/11/2015 14:53:22    CHAIN DataSource (at getTextDataSet(WordCount.java:141) (org.apache.flink.api.java.io.TextInputFormat)) -> FlatMap (FlatMap at main(WordCount.java:67)) -> │ormat)) -> FlatMap (FlatMap at main(WordCount.java:67)) -> Combine(SUM(1), at main(WordCount.java:67) (7/24) did not react to cancelling signal. Sending repeated interrupt.
{code}

Tasks are cancelled at the task managers by repeatedly sending an interrupt signal to the task threads.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)