Stephan Ewen created FLINK-5230:
-----------------------------------
Summary: Safety nets against leaving dysfunctional JobManagers
Key: FLINK-5230
URL:
https://issues.apache.org/jira/browse/FLINK-5230 Project: Flink
Issue Type: Improvement
Components: Distributed Coordination
Reporter: Stephan Ewen
There are certain ways that a {{JobManager}} can become dysfunctional.
If the JobManager process continues to exist (not restarted by YARN / Mesos) etc, but is not doing its work properly and more, it makes the Streaming Job unavailable.
There some safety nets to bring into place for that, see sub issues.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)