Ufuk Celebi created FLINK-1518:
----------------------------------
Summary: Laggy task cancelling on task managers
Key: FLINK-1518
URL:
https://issues.apache.org/jira/browse/FLINK-1518 Project: Flink
Issue Type: Improvement
Components: Distributed Runtime
Affects Versions: master
Reporter: Ufuk Celebi
When cancelling a task on a cluster setup via the web interface I've noticed that task cancellation is laggy. The tasks switch to the CANCELING state, but the cancelling of the tasks takes some time.
There are also corresponding log messages:
{code}
14:29:29,281 WARN org.apache.flink.runtime.execution.RuntimeEnvironment - Task CHAIN DataSource (at getTextDataSet(WordCount.java:141) (org.apache.flink.api.java.io.TextInputF
/02/11/2015 14:53:22 CHAIN DataSource (at getTextDataSet(WordCount.java:141) (org.apache.flink.api.java.io.TextInputFormat)) -> FlatMap (FlatMap at main(WordCount.java:67)) -> │ormat)) -> FlatMap (FlatMap at main(WordCount.java:67)) -> Combine(SUM(1), at main(WordCount.java:67) (7/24) did not react to cancelling signal. Sending repeated interrupt.
{code}
Tasks are cancelled at the task managers by repeatedly sending an interrupt signal to the task threads.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)