[jira] [Created] (FLINK-20217) More fine-grained timer processing

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Created] (FLINK-20217) More fine-grained timer processing

Shang Yuanchun (Jira)
Nico Kruber created FLINK-20217:
-----------------------------------

             Summary: More fine-grained timer processing
                 Key: FLINK-20217
                 URL: https://issues.apache.org/jira/browse/FLINK-20217
             Project: Flink
          Issue Type: Improvement
          Components: API / DataStream
    Affects Versions: 1.11.2, 1.10.2, 1.12.0
            Reporter: Nico Kruber


Timers are currently processed in one big block under the checkpoint lock (under {{InternalTimerServiceImpl#advanceWatermark}}. This can be problematic in a number of scenarios while doing checkpointing which would lead to checkpoints timing out (and even unaligned checkpoints would not help).

If you have a huge number of timers to process when advancing the watermark and the task is also back-pressured, the situation may actually be worse since you would block on the checkpoint lock and also wait for buffers/credits from the receiver.

I propose to make this loop more fine-grained so that it is interruptible by checkpoints, but maybe there is also some other way to improve here.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)