Ayrat Hudaygulov created FLINK-19039:
----------------------------------------
Summary: Parallel Flink Kafka Consumers compete with each other
Key: FLINK-19039
URL:
https://issues.apache.org/jira/browse/FLINK-19039 Project: Flink
Issue Type: Improvement
Components: Connectors / Kafka
Affects Versions: 1.11.1
Reporter: Ayrat Hudaygulov
If I'll run multiple Flink instances with same consumer group id they will not re-balance partitions with each other, but rather each instance take all partitions, effectively not working in parallel at all, and multiplying amount of messages processed.
This is because FlinkKafkaConsumer has its own re-balancing mechanism for current parallelism level and then just calls:
`consumerTmp.assign(newPartitionAssignments){color:#cc7832};{color}`
[
https://github.com/apache/flink/blob/59714b9d6addb1dbf2171cab937a0e3fec52f2b1/flink-connectors/flink-connector-kafka-0.10/src/main/java/org/apache/flink/streaming/connectors/kafka/internal/KafkaConsumerThread.java#L422]
I suppose there has to be a way to fallback to default kafka mechanism of re-balancing to respect consumer group id, but it's not presented in Flink at all.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)