Zhijiang created FLINK-18063:
--------------------------------
Summary: Fix the race condition for aborting current checkpoint in CheckpointBarrierUnaligner#processEndOfPartition
Key: FLINK-18063
URL:
https://issues.apache.org/jira/browse/FLINK-18063 Project: Flink
Issue Type: Bug
Components: Runtime / Checkpointing
Affects Versions: 1.11.0
Reporter: Zhijiang
Assignee: Zhijiang
Fix For: 1.11.0, 1.12.0
In the handle of CheckpointBarrierUnaligner#processEndOfPartition, it only aborts the current checkpoint by judging the condition of pending checkpoint from task thread processing, so it will miss one scenario that checkpoint triggered by notifyBarrierReceived from netty thread.
The proper fix should also judge the pending checkpoint inside ThreadSafeUnaligner in order to abort it and reset internal variables in case.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)