Theodore Vasiloudis created FLINK-2202:
------------------------------------------
             Summary: Calling distinct() requires tuple input
                 Key: FLINK-2202
                 URL: 
https://issues.apache.org/jira/browse/FLINK-2202             Project: Flink
          Issue Type: Improvement
          Components: Core, Scala API
            Reporter: Theodore Vasiloudis
            Priority: Minor
Currently to call distinct on a DataSet the elements must be placed in a tuple.
This creates the need to write code like the following:
{code}
val doubleDS: DataSet[Double] = ...
val uniqueDS = doubleDS.map( el => Tuple1(el)).distinct().map(t => t._1)
{code}
which looks quite unnecessary. Ideally we would like to just have to write:
 
{code}
val uniqueDS = doubleDS.distinct()
{code}
which should be possible as long as there exists an implicit {{Ordering\[T\]}} for a {{DataSet\[T\]}}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)