Julio Biason created FLINK-9960:
-----------------------------------
Summary: JobManager can't failover to hdfs with multiple namenodes
Key: FLINK-9960
URL:
https://issues.apache.org/jira/browse/FLINK-9960 Project: Flink
Issue Type: Bug
Components: JobManager
Affects Versions: 1.4.0
Reporter: Julio Biason
When Hadoop/HDFS is running in HA mode, there may be more than one namenode running: One active and another standby.
If, for whatever reasons, the server configured in high-availability.storageDir is currently in standby mode (e.g., Hadoop failed over the other namenode), the JobManager will refuse to start because it can't contact hdfs – in this case, it should have a list of servers (up to 2) and, in case of a failure to connect, go to the next one.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)