Craig Foster created FLINK-6222:
-----------------------------------
Summary: YARN: setting environment variables in an easier fashion
Key: FLINK-6222
URL:
https://issues.apache.org/jira/browse/FLINK-6222 Project: Flink
Issue Type: Improvement
Components: Startup Shell Scripts
Affects Versions: 1.2.0
Environment: YARN, EMR
Reporter: Craig Foster
Right now we require end-users to set YARN_CONF_DIR or HADOOP_CONF_DIR and sometimes FLINK_CONF_DIR.
For example, in [1], it is stated:
“Please note that the Client requires the YARN_CONF_DIR or HADOOP_CONF_DIR environment variable to be set to read the YARN and HDFS configuration.”
In BigTop, we set this with /etc/flink/default and then a wrapper is created to source that. However, this is slightly cumbersome and we don't have a central place within the Flink project itself to source environment variables. config.sh could do this but it doesn't have information about FLINK_CONF_DIR. For YARN and Hadoop variables, I already have a solution that would add "env.yarn.confdir" and "env.hadoop.confdir" variables to the flink-conf.yaml file and then we just symlink /etc/lib/flink/conf/ and /etc/flink/conf.
But we could also add a flink-env.sh file to set these variables and decouple them from config.sh entirely.
I'd like to know the opinion/preference of others and what would be more amenable.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)