Chesnay Schepler created FLINK-20605:
----------------------------------------
Summary: DeclarativeSlotManager crashes if slot allocation notification is processed after taskexecutor shutdown
Key: FLINK-20605
URL:
https://issues.apache.org/jira/browse/FLINK-20605 Project: Flink
Issue Type: Improvement
Components: Runtime / Coordination
Affects Versions: 1.13.0
Reporter: Chesnay Schepler
Assignee: Chesnay Schepler
Fix For: 1.13.0
It appears to be possible that a notification from a task executor about a slot being allocated can be processed after that very task executor has unregistered itself from the resource manager.
As a result we run into an exception when trying to mark this slot as allocated, because it no longer exists and a precondition catches this case.
We could solve this by checking in {{DeclarativeResourceManager#allocateSlot}} whether the task executor we received the acknowledge from is still registered.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)