Dawid Wysakowicz created FLINK-14908:
----------------------------------------
Summary: Distributing CacheFiles through DFS does not work
Key: FLINK-14908
URL:
https://issues.apache.org/jira/browse/FLINK-14908 Project: Flink
Issue Type: Bug
Components: Client / Job Submission, Runtime / REST
Affects Versions: 1.9.1, 1.8.2
Reporter: Dawid Wysakowicz
User reported that distributing cache files through DFS does not work anymore:
https://stackoverflow.com/questions/58978476/flink-1-9-wont-run-program-when-i-use-distributed-cache-whyI think the problematic part is in {{RestClusterClient#submitJob}}:
{code}
for (Map.Entry<String, DistributedCache.DistributedCacheEntry> artifacts : jobGraph.getUserArtifacts().entrySet()) {
artifactFileNames.add(new JobSubmitRequestBody.DistributedCacheFile(artifacts.getKey(), new Path(artifacts.getValue().filePath).getName()));
filesToUpload.add(new FileUpload(Paths.get(artifacts.getValue().filePath), RestConstants.CONTENT_TYPE_BINARY));
}
{code}
The code does not check if a file is in DFS, but just assumes it is in local FS and tries to add it to the rest request, which fails. The code on the receiver side in {{JobSubmitHandler}} still can support files distributed via DFS, but need to get proper paths to files in DFS.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)