|
hello:
I am currently learning flink.I recently had a problem with Flink for disaster recovery testing.I tried to find an answer on the official website and blog but failed.I am trying to find community help.
The current situation is:I have two servers, each with one slot.My application has two parallel operators with a degree of parallelism of 1, using the slotSharingGroup function to make them run in these two slots respectively.
My disaster recovery test is to shut down one of the servers. But is it possible that two parallel operators compete for the same server slot? In addition to this,I want to dynamically add or remove servers (simulated power failures,etc) while Flink is running, but I think this must cause stream data loss. Is it only one way to restart Flink through the checkpoint mechanism to ensure that data is not lost and the number of servers is dynamically configured?
Best
Zhong
|