Andrey Zagrebin created FLINK-13371:
---------------------------------------
Summary: Release partitions in JM of producer gets restarted
Key: FLINK-13371
URL:
https://issues.apache.org/jira/browse/FLINK-13371 Project: Flink
Issue Type: Bug
Components: Runtime / Coordination, Runtime / Network
Affects Versions: 1.9.0
Reporter: Andrey Zagrebin
As discussed in FLINK-13245, there can be a case that producer does not even detect any consumption attempt if consumer fails before the connection is established. It means we cannot fully rely on shuffle service for the release on consumption in case of consumer failure. When producer restarts it will leak partitions from the previous attempt. Previously we had an explicit release call for this case in Execution.cancel/suspend. Basically JM has to explicitly release all partitions produced by the previous task execution attempt in case of producer restart, including `released on consumption` partitions. For this change, we might need to track all partitions in PartitionTrackerImpl.
--
This message was sent by Atlassian JIRA
(v7.6.14#76016)