[jira] [Created] (FLINK-21289) Application mode on kubernetes deployment support run PackagedProgram.main with pipeline.classpaths

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Created] (FLINK-21289) Application mode on kubernetes deployment support run PackagedProgram.main with pipeline.classpaths

Shang Yuanchun (Jira)
Zhou Parker created FLINK-21289:
-----------------------------------

             Summary: Application mode on kubernetes deployment support run PackagedProgram.main with pipeline.classpaths
                 Key: FLINK-21289
                 URL: https://issues.apache.org/jira/browse/FLINK-21289
             Project: Flink
          Issue Type: Improvement
          Components: Client / Job Submission, Deployment / Kubernetes
    Affects Versions: 1.12.1, 1.11.2
         Environment: flink: 1.11

kubernetes: 1.15

 
            Reporter: Zhou Parker
         Attachments: 0001-IMP.patch

我尝试将flink作业以application mode方式提交到kubernetes上运行。但程序的依赖包并不完全存在于local:///opt/flink/usrlib/xxxx.jar中。导致找不到类。

在yarn上可以工作,是因为我们用 {color:#FF0000}-C [http://xxxx|http://xxxx/] {color}的方式,让依赖可以被URLClassloader加载。

但我发现,当实验提交到kubernetes时,-C只会在 configmap/flink-conf.yaml 中生成一个pipeline.classpaths 配置条目。我们的main函数可以执行,但是在加载外部依赖类的时候提示找不到类。

通过阅读源码,*我发现运行用户代码的类加载器实际并把 pipeline.classpaths 中的条目加入候选URL*,这导致了无法加载类的情况。从源码中,我也发现,可以通过将依赖包放在usrlib目录下(默认的userClassPath)可以解决问题。但我们的依赖可能是动态的,不合适一次性打到镜像里面。

我提议可以改进这个过程,将pipeline.classpaths也加入到对应的类加载器。这个改动很小,我自己经过测试,可以完美解决问题。

 

I'm trying to submit flink job to kubernetes cluster with application mode, but throw ClassNotFoundException when dependency class is not shipped in kind of local:///opt/flink/usrlib/xxxx.jar.

This works on yarn, since we use {color:#FF0000}-C http://xxxx{color} command line style that let dependency class  can be load by URLClassloader.

But i figure out that not works on kubernetes. When submit to kubernetes cluster, -C is only shipped as item "pipeline.classpaths" in configmap/flink-conf.yaml。

After read the source code, *i find out that the Classloader launching the "main" entry of user code without consider add pipeline.classpaths into candidates URLs*. from source code, i also learn that we can ship the dependency jar in the usrlib dir to solve the problem. But failed for me, we are not _preferred_ to ship dependencies in image at compile time, since they are known dynamically in runtime

I proposed improving the process, let the Classloader consider usrlib as well as pipeline.classpaths, this is a quite little change. I test the solution and it works quite well

 

 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)