qi quan created FLINK-11948:
-------------------------------
Summary: When kafka sink parallelism<kafka partition num,kafka data distribution unbalance
Key: FLINK-11948
URL:
https://issues.apache.org/jira/browse/FLINK-11948 Project: Flink
Issue Type: Bug
Components: Connectors / Kafka
Affects Versions: 1.7.2, 1.6.4, 1.6.3
Reporter: qi quan
The default FlinkFixedPartitioner return int[] partitions by subtaskid % partitions.length.When kafka sink parallelism<kafka partition num。It only the first few kafka partitions will write data.
I think it needs to be improved here.
{code:java}
@Override
public int partition(T record, byte[] key, byte[] value, String targetTopic, int[] partitions) {
Preconditions.checkArgument(
partitions != null && partitions.length > 0,
"Partitions of the target topic is empty.");
return partitions[parallelInstanceId % partitions.length];
}
{code}
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)