B Wyatt created FLINK-3974:
------------------------------
Summary: enableObjectReuse fails when an operator chains to multiple downstream operators
Key: FLINK-3974
URL:
https://issues.apache.org/jira/browse/FLINK-3974 Project: Flink
Issue Type: Bug
Components: DataStream API
Affects Versions: 1.0.3
Reporter: B Wyatt
Given a topology that looks like this:
{code:java}
DataStream<A> input = ...
input
.map(MapFunction<A,B>...)
.addSink(...);
input
.map(MapFunction<A,C>...)
.addSink(...);
{code}
enableObjectReuse() will cause an exception in the form of {{"java.lang.ClassCastException: B cannot be cast to A"}} to be thrown.
It looks like the input operator calls {{Output<StreamRecord<A>>.collect}} which attempts to loop over the downstream operators and process them.
However, the first map operation will call {{StreamRecord<>.replace}} which mutates the value stored in the StreamRecord<>.
As a result, when the {{Output<StreamRecord<A>>.collect}} call passes the {{StreamRecord<A>}} to the second map operation it is actually a {{StreamRecord<B>}} and behaves as if the two map operations were serial instead of parallel.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)