[jira] [Created] (FLINK-13256) Periodical checkpointing is stopped after failovers

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Created] (FLINK-13256) Periodical checkpointing is stopped after failovers

Shang Yuanchun (Jira)
Zhu Zhu created FLINK-13256:
-------------------------------

             Summary: Periodical checkpointing is stopped after failovers
                 Key: FLINK-13256
                 URL: https://issues.apache.org/jira/browse/FLINK-13256
             Project: Flink
          Issue Type: Bug
          Components: Runtime / Checkpointing
    Affects Versions: 1.9.0
            Reporter: Zhu Zhu
         Attachments: 15_15_20__07_15_2019.jpg, jm_no_cp_after_failover.log

In this case, we observed that the job initially is triggering periodical checkpoints as expected.

But after 2 region failovers, no checkpoint is triggered any more, even after all the tasks are RUNNING again.

A sample log(jm_no_cp_after_failover.log^!/jira/images/icons/link_attachment_7.gif|width=7,height=7!^) is attached along with the related topology desc pic.

This case may not be reproduced every time.

 



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)