[jira] [Created] (FLINK-17122) Support InputSelectable and BoundedMultiInput operators with checkpointing

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Created] (FLINK-17122) Support InputSelectable and BoundedMultiInput operators with checkpointing

Shang Yuanchun (Jira)
Piotr Nowojski created FLINK-17122:
--------------------------------------

             Summary: Support InputSelectable and BoundedMultiInput operators with checkpointing
                 Key: FLINK-17122
                 URL: https://issues.apache.org/jira/browse/FLINK-17122
             Project: Flink
          Issue Type: Wish
          Components: Runtime / Checkpointing, Runtime / Network
    Affects Versions: 1.10.0, 1.9.2
            Reporter: Piotr Nowojski


Currently when user defined some {{InputSelectable}} or {{BoundedMultiInput}} operators, checkpointing is not supported. Main problem is the that combination of {{InputSelectable}} and barrier alignment can lead to deadlocks (checkpoint barrier stuck on not selected channel).

Problem could be somehow mitigated via unaligned checkpoints (FLINK-14551), but not fully. Even with unaligned checkpoints, checkpoint barriers can be stuck in the job graph if there is a {{flatMap}} operator (or non {{flatMap}} operator but if records are spanning multiple buffers), blocked in the middle of processing by some down stream input selection. In such case we are not able to perform unaligned checkpoint.

Potential solution could be using persistent communication channels or detecting before mentioned situations and avoid the dead lock by spilling excess data.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)