[jira] [Created] (FLINK-16371) HadoopCompressionBulkWriter fails with 'java.io.NotSerializableException'

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Created] (FLINK-16371) HadoopCompressionBulkWriter fails with 'java.io.NotSerializableException'

Shang Yuanchun (Jira)
Sivaprasanna Sethuraman created FLINK-16371:
-----------------------------------------------

             Summary: HadoopCompressionBulkWriter fails with 'java.io.NotSerializableException'
                 Key: FLINK-16371
                 URL: https://issues.apache.org/jira/browse/FLINK-16371
             Project: Flink
          Issue Type: Bug
    Affects Versions: 1.10.0
            Reporter: Sivaprasanna Sethuraman


When using CompressWriterFactory with Hadoop compression codec, the execution fails with java.io.NotSerializableException. 

I guess this is probably to do with the the instance creation for Hadoop's CompressionCodec being done here at [CompressWriterFactory.java#L59|https://github.com/apache/flink/blob/master/flink-formats/flink-compress/src/main/java/org/apache/flink/formats/compress/CompressWriterFactory.java#L59] and thus it has to be sent over the wire causing the exception to be thrown.

So I did a quick test on my end by changing the way the CompressionCodec is created and ran it on a Hadoop cluster, and it has been working just fine. Will raise a PR in a day or so.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)