[jira] [Created] (FLINK-5230) Safety nets against leaving dysfunctional JobManagers

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Created] (FLINK-5230) Safety nets against leaving dysfunctional JobManagers

Shang Yuanchun (Jira)
Stephan Ewen created FLINK-5230:
-----------------------------------

             Summary: Safety nets against leaving dysfunctional JobManagers
                 Key: FLINK-5230
                 URL: https://issues.apache.org/jira/browse/FLINK-5230
             Project: Flink
          Issue Type: Improvement
          Components: Distributed Coordination
            Reporter: Stephan Ewen


There are certain ways that a {{JobManager}} can become dysfunctional.

If the JobManager process continues to exist (not restarted by YARN / Mesos) etc, but is not doing its work properly and more, it makes the Streaming Job unavailable.

There some safety nets to bring into place for that, see sub issues.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)