[jira] [Created] (FLINK-12058) Cancel checkpoint operations belonging to a discarded/aborted checkpoint

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Created] (FLINK-12058) Cancel checkpoint operations belonging to a discarded/aborted checkpoint

Shang Yuanchun (Jira)
Till Rohrmann created FLINK-12058:
-------------------------------------

             Summary: Cancel checkpoint operations belonging to a discarded/aborted checkpoint
                 Key: FLINK-12058
                 URL: https://issues.apache.org/jira/browse/FLINK-12058
             Project: Flink
          Issue Type: Improvement
          Components: Runtime / Checkpointing
    Affects Versions: 1.7.2, 1.8.0
            Reporter: Till Rohrmann


In order to save CPU cycles and reduce disk and network I/O, we should try to cancel local checkpoint operations belonging to discarded aborted or subsumed checkpoints. For example, if a {{Task}} declines a checkpoint, the {{CheckpointCoordinator}} will discard this checkpoint. However, other checkpointing operations belonging to this checkpoint won't be necessarily notified and canceled.

The notification mechanism could piggy back on the existing {{CancelCheckpointMarker}} or be a separate signal sent to all participating {{Tasks}}.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)