[ANNOUNCE] Weekly Community Update 2019/27

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[ANNOUNCE] Weekly Community Update 2019/27

Konstantin Knauf-3
Dear community,

this week's community digest with news on Flink 1.9.0 and Flink 1.8.1, our
Travis setup, Flink on PyPi, and a couple of new initiatives around the
DataStream API.

As always, please feel free to add additional updates and news to this
thread!

Flink Development
===============

* [releases] The feature freeze for *Flink 1.9.0 *has been announced on
Friday. The release branch will only be cut next Thursday though to allow
for some finishing strokes mainly on the planned features for the Table
API. [1]

* [releases] *Flink 1.8.1* has been released. The release blog post [2]
contains a list of all resolved issues.

* [development process] Over the last weeks there has been an ongoing
discussion on how to deal with the lack of *resources available for Flink's
CI (Travis)*. This week Chesnay has started a voting thread to move away
from ASF's Travis account to a Travis account sponsored by Ververica. This
got a lot of approvals, but the actual implementation is not trivial as we
can not link an external Travis account to the apache/flink repository. The
approach currently poc'ed by Chensay involves us fetching every PR to a
ververica-controlled repository and publishing the results back to
apache/flink. [3]

* [python] Flink 1.9.0 will contain a first version of a *Python Table API*.
Consequently, Jincheng started a discussion to publish it to* PyPi*. The
name will simply be "apache-flink". Currently, the main question is whether
the python package should bundle the binary distribution of Apache Flink
(Java/Scala) it depends on. It seems more feedback is needed to move this
forward. [4]

* [datastream-api] With FLIP-34 the Flink community introduced a the "stop"
command for graceful shutdown of a Flink Job. UDFs currently only have a
close() method, which is called for any type of shutdown. Now, Klou started
a discussion about adding an *optional interface*, which methods would only
be called during *gracelful shutdown* scenarios (but e.g. not in the case
of failure). [5,6]

* [datastream-api] Xingcan has started a VOTE on how to proceed with
the *split/select
API.* Voting Time ended one ago. More opinions wouldn't hurt to move this
forward. [7]

* [state management] After an offline sync between Vino and Kurt the
discussion on "*local keyed state" *and* "local aggregations"* covered last
week will now first focus on the proposal of "local keyed state" and
"localKeyBy()" as this was the core of Vino's original proposal.  [8]

* [logging] An old discussion thread [9] by Vino on separating the logs
between Flink Jobs within the same cluster was revived a bit this week in
the context of a new ticket [10]. A lot of the (interesting) discussion has
moved to this ticket for now.

[1]
http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/ANNOUNCE-Feature-freeze-for-Apache-Flink-1-9-0-release-td29751.html
[2] https://flink.apache.org/news/2019/07/02/release-1.8.1.html
[3]
http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/DISCUSS-solve-unstable-build-capacity-problem-on-TravisCI-tp29881.html
[4]
http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/DISCUSS-Publish-the-PyFlink-into-PyPI-td30095.html
[5]
http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/DISCUSS-Graceful-Shutdown-Handling-by-UDFs-td30229.html
[6]
https://docs.google.com/document/d/1SXfhmeiJfWqi2ITYgCgAoSDUv5PNq1T8Zu01nR5Ebog/edit
[7]
http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/VOTE-How-to-Deal-with-Split-Select-in-DataStream-API-td30233.html
[8]
http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/DISCUSS-Support-Local-Aggregation-in-Flink-td29307.html
[9] https://issues.apache.org/jira/browse/FLINK-12953
[10]
http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/DISCUSS-Flink-framework-and-user-log-separation-td27441.html

Notable Bugs
===========

* [FLINK-13063] [1.8.1] [1.7.2] [1.6.4] In an operator chain containing an
AsyncOperator records can be lost or duplicated due to a bug in the
AsyncOperator. Until the mailbox model has been fully implemented, the fix
ensures that the AsyncOperator can only be the head of an operator. Fixed
in 1.7.3 and 1.8.2. [11]

* [FLINK-12889] [1.8.1] [1.7.2] In the case where a TM fails with an OOME
during checkpointing, its Tasks might not reach a proper failed state,
because the failure handling mechanism is affected by the OOME, too. Fixed
in 1.73 and 1.8.2. [12]

* [FLINK-13059] The CassandraSink might become deadlocked during shutdown,
when an exception happens during asynchronously sending requests to
Cassandra. A PR is available but needs a review. [13]

* [FLINK-12122] [1.6.4] [ 1.7.2] [ 1.8.1] Not a new bug, but a continuous
source of questions: Since FLIP-6 tasks are not spread out evenly among all
TaskManagers as before, which can result in very uneven loads if the number
of available TaskSlots exceeds the parallelism of the submitted job. Right
now it looks like a fix is targeted for 1.9.1 and 1.8.2 [14]

[11] https://issues.apache.org/jira/browse/FLINK-13063
[12] https://issues.apache.org/jira/browse/FLINK-12889
[13] https://issues.apache.org/jira/browse/FLINK-13059
[14] https://issues.apache.org/jira/browse/FLINK-12122

Events, Blog Posts, Misc
====================

* *Flink Forward Europ*e early-bird ends on the 15th of July. [15]
* Upcoming Meetups
    * On 18th of July *Christos Hadjinikolis* is speaking at the "Big Data
LDN Meetup" on "How real-time data processing is used for application in
customer experience?" [16]

[15] https://berlin-2019.flink-forward.org/
[16] https://www.meetup.com/big-data-ldn/events/262638878/

Cheers,

Konstantin (@snntrable)

--

Konstantin Knauf | Solutions Architect

+49 160 91394525


--

Ververica GmbH | Invalidenstrasse 115, 10115 Berlin, Germany

--

Ververica GmbH
Registered at Amtsgericht Charlottenburg: HRB 158244 B
Managing Directors: Dr. Kostas Tzoumas, Dr. Stephan Ewen