Zhinan Cheng created FLINK-19010:
------------------------------------
Summary: Add a system metric to show the checkpoint restore time
Key: FLINK-19010
URL:
https://issues.apache.org/jira/browse/FLINK-19010 Project: Flink
Issue Type: Improvement
Components: Runtime / Metrics
Affects Versions: 1.11.1
Reporter: Zhinan Cheng
Now the system metric only shows the downtime when failure happens. It would be interesting to see the time to restore the checkpoint, so users can better understand the bottleneck of failure recovery.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)