Blog post about Parallel Joins in Flink - Mechanisms and Performance

classic Classic list List threaded Threaded
2 messages Options
Reply | Threaded
Open this post in threaded view
|

Blog post about Parallel Joins in Flink - Mechanisms and Performance

Stephan Ewen
Hello Squirrels!

Flink committer Fabian Hueske has written a very nice article about joins
in Apache Flink.

The article talks about joins in the APIs, the join algorithms, memory
management, and performance experiments on a small cluster.
A good read for everyone with SQL/ETL-style use cases and for those who are
interested to learn more about how Flink works.


http://flink.apache.org/news/2015/03/13/peeking-into-Apache-Flinks-Engine-Room.html


Have fun reading!
Reply | Threaded
Open this post in threaded view
|

Re: Blog post about Parallel Joins in Flink - Mechanisms and Performance

Henry Saputra
Great post.

Small comment, @Fabian I think the image is flipped.
The image for Repartition-Repartition is shown in the Broadcast-Forward section.

- Henry

On Tue, Mar 17, 2015 at 2:22 PM, Stephan Ewen <[hidden email]> wrote:

> Hello Squirrels!
>
> Flink committer Fabian Hueske has written a very nice article about joins
> in Apache Flink.
>
> The article talks about joins in the APIs, the join algorithms, memory
> management, and performance experiments on a small cluster.
> A good read for everyone with SQL/ETL-style use cases and for those who are
> interested to learn more about how Flink works.
>
>
> http://flink.apache.org/news/2015/03/13/peeking-into-Apache-Flinks-Engine-Room.html
>
>
> Have fun reading!