[jira] [Created] (FLINK-8360) Implement task-local state recovery

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Created] (FLINK-8360) Implement task-local state recovery

Shang Yuanchun (Jira)
Stefan Richter created FLINK-8360:
-------------------------------------

             Summary: Implement task-local state recovery
                 Key: FLINK-8360
                 URL: https://issues.apache.org/jira/browse/FLINK-8360
             Project: Flink
          Issue Type: New Feature
          Components: State Backends, Checkpointing
            Reporter: Stefan Richter
            Assignee: Stefan Richter
             Fix For: 1.5.0


This issue tracks the development of recovery from task-local state. The main idea is to have a secondary, local copy of the checkpointed state, while there is still a primary copy in DFS that we report to the checkpoint coordinator.

Recovery can attempt to restore from the secondary local copy, if available, to save network bandwidth. This requires that the assignment from tasks to slots is as sticky is possible.

For starters, we will implement this feature for all managed keyed states and can easily enhance it to all other state types (e.g. operator state) later.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)