[jira] [Created] (FLINK-1115) Spurious Test Failures due to failing file output streams

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Created] (FLINK-1115) Spurious Test Failures due to failing file output streams

Shang Yuanchun (Jira)
Stephan Ewen created FLINK-1115:
-----------------------------------

             Summary: Spurious Test Failures due to failing file output streams
                 Key: FLINK-1115
                 URL: https://issues.apache.org/jira/browse/FLINK-1115
             Project: Flink
          Issue Type: Bug
          Components: Distributed Runtime
    Affects Versions: 0.7-incubating
            Reporter: Stephan Ewen
            Assignee: Stephan Ewen
            Priority: Minor


Once in a while, a test fails with the stack trace blow, failing to open a file output stream. I am not fully sure what causes that, but I suspect a race between creating/clearing the output directory, and the file stream creation.

We could simply have one or two retries when the local file system' output stream creation fails with "FileNotFoundException"

-----

23:39:57.272 [DataSink(Output) (1/4)] ERROR org.apache.flink.runtime.operators.DataSinkTask  - Error in user code: /tmp/org.apache.flink.test.recordJobTests.TPCHQuery10ITCase-result.txt/1 (No such file or directory):  DataSink(Output) (1/4)
java.io.FileNotFoundException: /tmp/org.apache.flink.test.recordJobTests.TPCHQuery10ITCase-result.txt/1 (No such file or directory)
        at java.io.FileOutputStream.open(Native Method) ~[na:1.8.0_05]
        at java.io.FileOutputStream.<init>(FileOutputStream.java:206) ~[na:1.8.0_05]
        at java.io.FileOutputStream.<init>(FileOutputStream.java:156) ~[na:1.8.0_05]
        at org.apache.flink.core.fs.local.LocalDataOutputStream.<init>(LocalDataOutputStream.java:50) ~[flink-core-0.7-incubating-SNAPSHOT.jar:0.7-incubating-SNAPSHOT]
        at org.apache.flink.core.fs.local.LocalFileSystem.create(LocalFileSystem.java:247) ~[flink-core-0.7-incubating-SNAPSHOT.jar:0.7-incubating-SNAPSHOT]
        at org.apache.flink.core.fs.local.LocalFileSystem.create(LocalFileSystem.java:254) ~[flink-core-0.7-incubating-SNAPSHOT.jar:0.7-incubating-SNAPSHOT]
        at org.apache.flink.api.common.io.FileOutputFormat.open(FileOutputFormat.java:239) ~[flink-core-0.7-incubating-SNAPSHOT.jar:0.7-incubating-SNAPSHOT]
        at org.apache.flink.runtime.operators.DataSinkTask.invoke(DataSinkTask.java:174) ~[flink-runtime-0.7-incubating-SNAPSHOT.jar:0.7-incubating-SNAPSHOT]
        at org.apache.flink.runtime.execution.RuntimeEnvironment.run(RuntimeEnvironment.java:235) [flink-runtime-0.7-incubating-SNAPSHOT.jar:0.7-incubating-SNAPSHOT]
        at java.lang.Thread.run(Thread.java:745) [na:1.8.0_05]



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)