wgcn created FLINK-12728:
----------------------------
Summary: taskmanager container can't launch on nodemanager machine because of kerberos
Key: FLINK-12728
URL:
https://issues.apache.org/jira/browse/FLINK-12728 Project: Flink
Issue Type: Bug
Components: Deployment / YARN
Affects Versions: 1.7.2
Environment: linux
jdk8
hadoop 2.7.2
flink 1.7.2
Reporter: wgcn
Attachments: AM.log, NM.log
job can't restart when flink job has been running for a long time and then taskmanager restarting ,i find log in AM that AM request containers taskmanager all the time . log in NodeManager show that the new requested containers can't downloading file from hdfs because of kerberos . I configed the keytab config that
security.kerberos.login.use-ticket-cache: false
security.kerberos.login.keytab: /data/sysdir/knit/user/.flink.keytab
security.kerberos.login.principal: [flink/client-docker-201-53.hadoop.lq@HADOOP.LQ2. |mailto:flink/client-docker-201-53.hadoop.lq@HADOOP.LQ2.]
at flink-client machine and keytab is exist.
I showed the logs at AM and NodeManager below.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)