Chesnay Schepler created FLINK-5179:
---------------------------------------
Summary: MetricRegistry life-cycle issues with HA
Key: FLINK-5179
URL:
https://issues.apache.org/jira/browse/FLINK-5179 Project: Flink
Issue Type: Bug
Components: Metrics
Affects Versions: 1.1.3
Reporter: Chesnay Schepler
Assignee: Chesnay Schepler
Priority: Blocker
Fix For: 1.2.0, 1.1.4
The TaskManager's MetricRegistry is started when the TaskManager is created, and shutdown in the TaskManager's postStop method.
However, the registry is also shutdown within the TaskManager's disassociateFromJobManager method; however it is not restarted when the connection is re-established.
Effectively this means that a TaskManager that ever reconnected to a JobManager will not report any metrics, since the reporters are shutdown as well. Metrics will neither be sent to the WebInterface anymore.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)