[jira] [Created] (FLINK-3634) Fix documentation for DataSetUtils.zipWithUniqueId()

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Created] (FLINK-3634) Fix documentation for DataSetUtils.zipWithUniqueId()

Shang Yuanchun (Jira)
Greg Hogan created FLINK-3634:
---------------------------------

             Summary: Fix documentation for DataSetUtils.zipWithUniqueId()
                 Key: FLINK-3634
                 URL: https://issues.apache.org/jira/browse/FLINK-3634
             Project: Flink
          Issue Type: Improvement
          Components: Documentation
    Affects Versions: 1.1.0
            Reporter: Greg Hogan
            Assignee: Greg Hogan
            Priority: Minor
             Fix For: 1.1.0, 1.0.1


Under FLINK-2590 the assignment and testing of unique IDs was improved but the documentation looks to still reference the old implementation.

With {{parallelism=1}} there is no difference between zipWithUniqueID and zipWithIndex. With greater parallelism the results of zipWithUniqueID are dependent on the partitioning.

The documentation should demonstrate a possible result that is different from the incremental sequence of zipWithIndex while noting that results are dependent on the parallelism and partitioning.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)