[jira] [Created] (FLINK-1444) Add data properties for data sources

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Created] (FLINK-1444) Add data properties for data sources

Shang Yuanchun (Jira)
Fabian Hueske created FLINK-1444:
------------------------------------

             Summary: Add data properties for data sources
                 Key: FLINK-1444
                 URL: https://issues.apache.org/jira/browse/FLINK-1444
             Project: Flink
          Issue Type: New Feature
          Components: Java API, JobManager, Optimizer
    Affects Versions: 0.9
            Reporter: Fabian Hueske
            Priority: Minor


This issue proposes to add support for attaching data properties to data sources. These data properties are defined with respect to input splits.
Possible properties are:

- partitioning across splits: all elements of the same key (combination) are contained in one split
- sorting / grouping with splits: elements are sorted or grouped on certain keys within a split
- key uniqueness: a certain key (combination) is unique for all elements of the data source. This property is not defined wrt. input splits.

The optimizer can leverage this information to generate more efficient execution plans.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)