Stephan Ewen created FLINK-17674:
------------------------------------
Summary: OperatorCoordinator state in checkpoints should always be a ByteStreamStateHandle
Key: FLINK-17674
URL:
https://issues.apache.org/jira/browse/FLINK-17674 Project: Flink
Issue Type: Bug
Components: Runtime / Checkpointing
Reporter: Stephan Ewen
Assignee: Stephan Ewen
Fix For: 1.11.0
State restore to the task vertices and coordinators (even after loading the Checkpoint Metadata) happens in the JobManager's main thread and must consequently not do any potentially blocking I/O operations.
The OperatorCoordinator state is a generic {{StreamStateHandle}} whose state might require I/O to retrieve. This never happens in the current implementation (we always use {{ByteStreamStateHandle}}) the signatures and contracts don't guarantee that and leave this open for a potential future bug.
Typing the OperatorCoordinator state to ByteStreamStateHandle makes sure that we can always retrieve the data directly without I/O and clarifies that no arbitrary StreamStateHandle is supported at that point.
If state restoring becomes an asynchronous operation we can relax this restriction.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)