[jira] [Created] (FLINK-4329) Streaming File Source Must Correctly Handle Timestamps/Watermarks

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Created] (FLINK-4329) Streaming File Source Must Correctly Handle Timestamps/Watermarks

Shang Yuanchun (Jira)
Aljoscha Krettek created FLINK-4329:
---------------------------------------

             Summary: Streaming File Source Must Correctly Handle Timestamps/Watermarks
                 Key: FLINK-4329
                 URL: https://issues.apache.org/jira/browse/FLINK-4329
             Project: Flink
          Issue Type: Bug
          Components: Streaming Connectors
    Affects Versions: 1.1.0
            Reporter: Aljoscha Krettek
             Fix For: 1.1.1


The {{ContinuousFileReaderOperator}} does not correctly deal with watermarks, i.e. they are just passed through. This means that when the {{ContinuousFileMonitoringFunction}} closes and emits a {{Long.MAX_VALUE}} that watermark can "overtake" the records that are to be emitted in the {{ContinuousFileReaderOperator}}. Together with the new "allowed lateness" setting in window operator this can lead to elements being dropped as late.

Also, {{ContinuousFileReaderOperator}} does not correctly assign ingestion timestamps since it is not technically a source but looks like one to the user.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)