[jira] [Created] (FLINK-16929) Session Window produces sessions randomly

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Created] (FLINK-16929) Session Window produces sessions randomly

Shang Yuanchun (Jira)
Ori Popowski created FLINK-16929:
------------------------------------

             Summary: Session Window produces sessions randomly
                 Key: FLINK-16929
                 URL: https://issues.apache.org/jira/browse/FLINK-16929
             Project: Flink
          Issue Type: Bug
    Affects Versions: 1.9.1
            Reporter: Ori Popowski
         Attachments: image-2020-04-01-19-43-07-288.png, image-2020-04-01-19-44-35-714.png

We have a Flink job which keyBys session ID (sId), and uses a session window with 30 minutes gap:
{code:java}
inputStream
    .keyBy(keySelector)
    .window(EventTimeSessionWindows.withGap(Time.minutes(30)))
    .allowedLateness(Time.seconds(0L))
{code}
This Flink job reads from Kinesis stream.

Lately (I suspect after upgrading from 1.5.4 to 1.9.1) we get too many sessions, with gaps of several seconds (instead of 30 minutes).

We have no idea why it's happening and suspect a Flink bug or a state backend bug (we use RocksDB).

I haven't found any indication in the logs except for some read throughput warnings which were resolved by a backoff.

Attached is a table of derived sessions, and then the raw events

*Sessions*

!image-2020-04-01-19-43-07-288.png!

 

*Events*

*!image-2020-04-01-19-44-35-714.png!*

 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)