Jingsong Lee created FLINK-22202:
------------------------------------
Summary: Thread safety in ParquetColumnarRowInputFormat
Key: FLINK-22202
URL:
https://issues.apache.org/jira/browse/FLINK-22202 Project: Flink
Issue Type: Bug
Components: Formats (JSON, Avro, Parquet, ORC, SequenceFile)
Reporter: Jingsong Lee
Assignee: Jingsong Lee
Fix For: 1.13.0
In a {{VectorizedColumnBatch}}, the dictionary will be lazied deserialized.
If there are multiple batches at the same time, there may be thread safety problems, because the deserialization of the dictionary depends on some internal structures.
We need set numBatchesToCirculate to 1 for ParquetColumnarRowInputFormat.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)