[jira] [Created] (FLINK-14708) Introduce full restarts failover strategy for NG scheduler

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Created] (FLINK-14708) Introduce full restarts failover strategy for NG scheduler

Shang Yuanchun (Jira)
Zhu Zhu created FLINK-14708:
-------------------------------

             Summary: Introduce full restarts failover strategy for NG scheduler
                 Key: FLINK-14708
                 URL: https://issues.apache.org/jira/browse/FLINK-14708
             Project: Flink
          Issue Type: Sub-task
          Components: Runtime / Coordination
    Affects Versions: 1.10.0
            Reporter: Zhu Zhu
             Fix For: 1.10.0


This strategy allows users to force all the tasks to be restarted if any task fails.
It may help if it's non-sense or even worse if only part of the tasks can acquire resources and process data. In such cases, checkpoints cannot be completed and progress cannot be persisted.

Supporting this strategy also enables the NG scheduler to be backward compatible for existing failover strategy configuration.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)