[jira] [Created] (FLINK-2261) Remove reduce/aggregation from DataStream

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Created] (FLINK-2261) Remove reduce/aggregation from DataStream

Shang Yuanchun (Jira)
Gyula Fora created FLINK-2261:
---------------------------------

             Summary: Remove reduce/aggregation from DataStream
                 Key: FLINK-2261
                 URL: https://issues.apache.org/jira/browse/FLINK-2261
             Project: Flink
          Issue Type: Improvement
          Components: Java API, Scala API, Streaming
    Affects Versions: 0.10
            Reporter: Gyula Fora
            Assignee: Gyula Fora


Currently we have reduce and aggregation methods for non-grouped DataStreams as well, which will produce local aggregates depending on the parallelism of the operator.

This behaviour is neither intuitive nor useful as it only produces sensible results if the user specifically sets the parallelism to 1 which should not be encouraged.

I would like to remove these methods from the DataStream api and only keep it for GroupedDataStreams and WindowedDataStream where the aggregation is either executed per-key or per-window.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)