Mark Cho created FLINK-11133:
--------------------------------
Summary: FsCheckpointStorage is unaware about S3 entropy when creating directories
Key: FLINK-11133
URL:
https://issues.apache.org/jira/browse/FLINK-11133 Project: Flink
Issue Type: Bug
Affects Versions: 1.7.0
Reporter: Mark Cho
We currently use S3 for our checkpoint storage with S3 entropy enabled.
Entropy seems to be working correctly when writing out checkpoint metadata file (entropy key is correctly stripped from `state.checkpoints.dir`) and when writing out checkpoint data file (entropy key is correctly replaced with random string).
However, from the logs, it seems like entropy key is not stripped or replaced when `FsCheckpointStorage` creates directories in the following class:
[
https://github.com/apache/flink/blob/master/flink-runtime/src/main/java/org/apache/flink/runtime/state/filesystem/FsCheckpointStorage.java#L83-L85]
Should FsCheckpointStorage skip initializing mkdir calls if object store like S3 is used since S3 doesn't have directory concept?
If we want to keep the `mkdir` calls in `FsCheckpointStorage`, we should handle the entropy key specified in `state.checkpoints.dir`. Currently, folder markers in S3 are being created by Hadoop FileSystem with the entropy key in the path as a result of `mkdir` calls in `FsCheckpointStorage`).
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)