godfrey he created FLINK-16322:
----------------------------------
Summary: wrong result after filter push down in parquet table source
Key: FLINK-16322
URL:
https://issues.apache.org/jira/browse/FLINK-16322 Project: Flink
Issue Type: Bug
Components: Formats (JSON, Avro, Parquet, ORC, SequenceFile)
Reporter: godfrey he
Attachments: parquet-1-1.parquet
I get the wrong result when run the following query:
source schema:
first VARCHAR
id INT
score DOUBLE
last VARCHAR
data: (parquet file is in the attachment)
("Mike", 1, 12.3d, "Smith"),
("Bob", 2, 45.6d, "Taylor"),
("Sam", 3, 7.89d, "Miller"),
("Peter", 4, 0.12d, "Smith"),
("Liz", 5, 34.5d, "Williams"),
("Sally", 6, 6.78d, "Miller"),
("Alice", 7, 90.1d, "Smith"),
("Kelly", 8, 2.34d, "Williams")
query:
SELECT id, `first`, `last`, score FROM ParquetTable WHERE score < 3
the expected result size is 2, however the actual result size is 0.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)