Yingjie Cao created FLINK-22910:
-----------------------------------
Summary: ShuffleMaster enhancement for pluggable shuffle service framework
Key: FLINK-22910
URL:
https://issues.apache.org/jira/browse/FLINK-22910 Project: Flink
Issue Type: Sub-task
Components: Runtime / Coordination
Reporter: Yingjie Cao
Fix For: 1.14.0
The current _ShuffleMaster_ has an unclear lifecycle which is inconsistent with the _ShuffleEnvironment_ at the _TM_ side. Besides, it is hard to Implement some important capabilities for remote shuffle service. For example, 1) release external resources when a job finished; 2) Stop or start tracking some partitions depending on the status of the external service or system.
We drafted a document[1] which proposed some simple changes to solve these issues. The document is still not wholly completed yet. We will start a discussion once it is finished.
[1]
https://docs.google.com/document/d/1_cHoapNbx_fJ7ZNraSqw4ZK1hMRiWWJDITuSZrdMDDs/edit?usp=sharing--
This message was sent by Atlassian Jira
(v8.3.4#803005)