Re: Discussing about the 'starter' issue

Posted by Robert Metzger on
URL: http://deprecated-apache-flink-mailing-list-archive.368.s1.nabble.com/Discussing-about-the-starter-issue-tp283p319.html

Hi Wilson,

sorry for the delayed response. We are always happy about new contributors
to our project.
One thing that I would like to have in our Java API is a DataSet.sample()
transformation for sampling data in a DataSet. If you want, have a look
into sampling techniques and implement it for Flink.

If you want to familiarize yourself with extending our "DataSet"
abstraction, you can also look into
https://issues.apache.org/jira/browse/FLINK-834. I think it would be a good
preparation for the sampling task.


Robert



On Mon, Jun 16, 2014 at 2:33 PM, Wilson Cao <[hidden email]> wrote:

> Hi,
>
> Hi everyone, I am a starter of this project, so working on some 'starter'
> issue would help me learn more quickly!
> So I am wondering what issues would be suitable for me?
>
> Yours,
> Wilson
>