Theodore Vasiloudis created FLINK-1901:
------------------------------------------
Summary: Create sample operator for Dataset
Key: FLINK-1901
URL:
https://issues.apache.org/jira/browse/FLINK-1901 Project: Flink
Issue Type: Improvement
Components: Core
Reporter: Theodore Vasiloudis
In order to be able to implement Stochastic Gradient Descent and a number of other machine learning algorithms we need to have a way to take a random sample from a Dataset.
We need to be able to sample with or without replacement from the Dataset, choose the relative size of the sample, and set a seed for reproducibility.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)