[jira] [Created] (FLINK-12736) ResourceManager may release TM with allocated slots

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Created] (FLINK-12736) ResourceManager may release TM with allocated slots

Shang Yuanchun (Jira)
Chesnay Schepler created FLINK-12736:
----------------------------------------

             Summary: ResourceManager may release TM with allocated slots
                 Key: FLINK-12736
                 URL: https://issues.apache.org/jira/browse/FLINK-12736
             Project: Flink
          Issue Type: Bug
          Components: Runtime / Coordination
    Affects Versions: 1.9.0
            Reporter: Chesnay Schepler
             Fix For: 1.9.0


The {{ResourceManager}} looks out for TaskManagers that have not had any slots allocated on them for a while, as these could be released to safe resources. If such a TM is found the RM checks via an RPC call whether the TM still holds any partitions. If no partition is held then the TM is released.

However, in the RPC callback no check is made whether the TM is actually _still_ idle. In the meantime a slot could've been allocated on the TM.





--
This message was sent by Atlassian JIRA
(v7.6.3#76005)