[jira] [Created] (FLINK-16443) Fix wrong fix for user-code CheckpointExceptions

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Created] (FLINK-16443) Fix wrong fix for user-code CheckpointExceptions

Shang Yuanchun (Jira)
Stephan Ewen created FLINK-16443:
------------------------------------

             Summary: Fix wrong fix for user-code CheckpointExceptions
                 Key: FLINK-16443
                 URL: https://issues.apache.org/jira/browse/FLINK-16443
             Project: Flink
          Issue Type: Bug
          Components: Runtime / Checkpointing
            Reporter: Stephan Ewen
             Fix For: 1.11.0


The problem of having exceptions that are only in the user code classloader was fixed by proactively serializing them inside the {{CheckpointException}}. That means all consumers of  {{CheckpointException}} now need to be aware of that and unwrap the serializable exception.

I believe the right way to fix this would have been to use a SerializedException in the {{DeclineCheckpoint}} message instead, which would have localized the change to the actual problem: RPC transport.

I would suggest to revert https://github.com/apache/flink/pull/9742 and instead apply the above described change.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)