[jira] [Created] (FLINK-6120) Implement heartbeat logic between JobMaster and ResourceManager

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Created] (FLINK-6120) Implement heartbeat logic between JobMaster and ResourceManager

Shang Yuanchun (Jira)
zhijiang created FLINK-6120:
-------------------------------

             Summary: Implement heartbeat logic between JobMaster and ResourceManager
                 Key: FLINK-6120
                 URL: https://issues.apache.org/jira/browse/FLINK-6120
             Project: Flink
          Issue Type: Improvement
            Reporter: zhijiang
            Assignee: zhijiang


It is part of work for Flip-6.

The HeartbeatManager is mainly used for monitoring heartbeat target and reporting payloads.

For {{ResourceManager}} side, it would trigger monitoring the {{HeartbeatTarget}} when receive registration from {{JobMaster}}, and schedule a task to {{requestHeartbeat}} at interval time. If not receive heartbeat response within duration time, the {{HeartbeatListener}} will notify heartbeat timeout, then the {{ResourceManager}} should remove the internal registered {{JobMaster}}.

For {{JobMaster}} side, it would trigger monitoring the {{HeartbeatTarget}} when receive registration acknowledgement from {{ResourceManager}}. An it will also be notified heartbeat timeout if not receive heartbeat request from {{ResourceManager}} within duration time.

The current implementation will not interact payloads via heartbeat, and it can be added if needed future.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)