[jira] [Created] (FLINK-22509) ./bin/flink run -m yarn-cluster -d submission leads to IllegalStateException

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Created] (FLINK-22509) ./bin/flink run -m yarn-cluster -d submission leads to IllegalStateException

Shang Yuanchun (Jira)
Robert Metzger created FLINK-22509:
--------------------------------------

             Summary: ./bin/flink run -m yarn-cluster -d submission leads to IllegalStateException
                 Key: FLINK-22509
                 URL: https://issues.apache.org/jira/browse/FLINK-22509
             Project: Flink
          Issue Type: Bug
          Components: Deployment / YARN
    Affects Versions: 1.13.0, 1.14.0
            Reporter: Robert Metzger


Submitting a detached, per-job YARN cluster in Flink (like this: {{./bin/flink run -m yarn-cluster -d  ./examples/streaming/TopSpeedWindowing.jar}}), leads to the following exception:

{code}
2021-04-28 11:39:00,786 INFO  org.apache.flink.yarn.YarnClusterDescriptor                  [] - Found Web Interface ip-172-31-27-232.eu-central-1.compute.internal:45689 of application 'application_1619607372651_0005'.
Job has been submitted with JobID 5543e81db9c2de78b646088891f23bfc
Exception in thread "Thread-4" java.lang.IllegalStateException: Trying to access closed classloader. Please check if you store classloaders directly or indirectly in static fields. If the stacktrace suggests that the leak occurs in a third party library and cannot be fixed immediately, you can disable this check with the configuration 'classloader.check-leaked-classloader'.
        at org.apache.flink.runtime.execution.librarycache.FlinkUserCodeClassLoaders$SafetyNetWrapperClassLoader.ensureInner(FlinkUserCodeClassLoaders.java:164)
        at org.apache.flink.runtime.execution.librarycache.FlinkUserCodeClassLoaders$SafetyNetWrapperClassLoader.getResource(FlinkUserCodeClassLoaders.java:183)
        at org.apache.hadoop.conf.Configuration.getResource(Configuration.java:2570)
        at org.apache.hadoop.conf.Configuration.loadResource(Configuration.java:2783)
        at org.apache.hadoop.conf.Configuration.loadResources(Configuration.java:2758)
        at org.apache.hadoop.conf.Configuration.getProps(Configuration.java:2638)
        at org.apache.hadoop.conf.Configuration.get(Configuration.java:1100)
        at org.apache.hadoop.conf.Configuration.getTimeDuration(Configuration.java:1707)
        at org.apache.hadoop.conf.Configuration.getTimeDuration(Configuration.java:1688)
        at org.apache.hadoop.util.ShutdownHookManager.getShutdownTimeout(ShutdownHookManager.java:183)
        at org.apache.hadoop.util.ShutdownHookManager.shutdownExecutor(ShutdownHookManager.java:145)
        at org.apache.hadoop.util.ShutdownHookManager.access$300(ShutdownHookManager.java:65)
        at org.apache.hadoop.util.ShutdownHookManager$1.run(ShutdownHookManager.java:102)
{code}

The job is still running as expected.
Detached submission with {{./bin/flink run-application -t yarn-application -d}} works as expected. This is also the documented approach.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)