[jira] [Created] (FLINK-11318) [Regression] StreamingFileSink can overwrite existing files

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Created] (FLINK-11318) [Regression] StreamingFileSink can overwrite existing files

Shang Yuanchun (Jira)
Edward Rojas created FLINK-11318:
------------------------------------

             Summary: [Regression] StreamingFileSink can overwrite existing files
                 Key: FLINK-11318
                 URL: https://issues.apache.org/jira/browse/FLINK-11318
             Project: Flink
          Issue Type: Bug
          Components: filesystem-connector
    Affects Versions: 1.7.1
            Reporter: Edward Rojas


StreamingFileSink does not validate if a file with the same name of the new part file already exists and this could result in overwriting a file.

The BucketingSink perform this kind of validations in the "openNewPartFile" method here: 
https://github.com/apache/flink/blob/master/flink-connectors/flink-connector-filesystem/src/main/java/org/apache/flink/streaming/connectors/fs/bucketing/BucketingSink.java#L549-L561

So this seems to be a regression an in the "old" BucketingSink this works.

 

This can be problematic for example when migrating a job using Bucketing to to use the StreamingFileSink, file could be overwritten.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)