[jira] [Created] (FLINK-2312) Random Splits

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Created] (FLINK-2312) Random Splits

Shang Yuanchun (Jira)
Maximilian Alber created FLINK-2312:
---------------------------------------

             Summary: Random Splits
                 Key: FLINK-2312
                 URL: https://issues.apache.org/jira/browse/FLINK-2312
             Project: Flink
          Issue Type: Wish
          Components: Machine Learning Library
            Reporter: Maximilian Alber
            Priority: Minor


In machine learning applications it is common to split data sets into f.e. training and testing set.

To the best of my knowledge there is at the moment no nice way in Flink to split a data set randomly into several partitions according to some ratio.

The wished semantic would be the same as of Sparks RDD randomSplit.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)