Hequn Cheng created FLINK-18761:
-----------------------------------
Summary: Support Python DataStream API (Stateless part)
Key: FLINK-18761
URL:
https://issues.apache.org/jira/browse/FLINK-18761 Project: Flink
Issue Type: New Feature
Components: API / DataStream, API / Python
Reporter: Hequn Cheng
This is the umbrella Jira for FLIP-130, which intends to support Python DataStream API for the stateless part.
FLIP wiki page: [
https://cwiki.apache.org/confluence/pages/viewpage.action?pageId=158866298]
As we all know, Flink provides [three layered APIs|
https://flink.apache.org/flink-applications.html#layered-apis]: the ProcessFunctions, the DataStream API and the SQL & Table API. Each API offers a different trade-off between conciseness and expressiveness and targets different use cases.
Currently, the SQL & Table API has already been supported in PyFlink. The API provides relational operations as well as user-defined functions to provide convenience for users who are familiar with python and relational programming.
Meanwhile, the DataStream API and ProcessFunctions provide more generic APIs to implement stream processing applications. The ProcessFunctions expose time and state which are the fundamental building blocks for any kind of streaming application. To cover more use cases, we are planning to cover all these APIs in PyFlink.
In this FLIP, we propose to support the Python DataStream API for the stateless part. For more detail, please refer to the [FLIP wiki page|
https://cwiki.apache.org/confluence/pages/viewpage.action?pageId=158866298]. As for the stateful part, it would come later after this FLIP.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)