KAFKA-10551: Add topic id support to produce request and response #15968

OmniaGM · 2024-05-15T15:53:10Z

Add support topicId in ProduceRequest/ProduceResponse. Topic name and Topic Id will become ignorable following the footstep of FetchRequest/FetchResponse
ReplicaManager still look for HostedPartition using TopicPartition and doesn't check topic id. This is an [OPEN QUESTION] if we should address this in this pr or wait for KAFKA-16212 as this will update ReplicaManager::getPartition to use TopicIdParittion once we update the cache. Other option is that we compare provided topicId with Partition topic id and return UNKNOW_TOPIC_ID or UNKNOW_TOPIC_PARTITION if we can't find partition with matched topic id.

Committer Checklist (excluded from commit message)

Verify design and implementation
Verify test coverage and CI build status
Verify documentation (including upgrade notes)

…equest

OmniaGM · 2024-05-16T13:01:33Z

Few of the failed tests are related to this change and am working on fixing them

OmniaGM · 2024-05-20T12:00:14Z

Few of the failed tests are related to this change and am working on fixing them

I believe that failed tests now are unrelated

jolshan · 2024-05-20T22:29:30Z

Topic name and Topic Id will become optional following the footstep of FetchRequest/FetchResponse

My understanding is that all requests going forward will use ID and not name similar to fetch request. I believe that is what is in the PR, but the comment suggests otherwise.

jolshan · 2024-05-20T22:41:11Z

clients/src/main/java/org/apache/kafka/clients/producer/internals/Sender.java

@@ -610,7 +611,9 @@ private void handleProduceResponse(ClientResponse response, Map<TopicPartition,
 // This will be set by completeBatch.
 Map<TopicPartition, Metadata.LeaderIdAndEpoch> partitionsWithUpdatedLeaderInfo = new HashMap<>();
 produceResponse.data().responses().forEach(r -> r.partitionResponses().forEach(p -> {
- TopicPartition tp = new TopicPartition(r.name(), p.index());
+ // Version 12 drop topic name and add support to topic id. However, metadata can be used to map topic id to topic name.
+ String topicName = (r.name() == null || r.name().isEmpty()) ? metadata.topicNames().get(r.topicId()) : r.name();


What do we do if metadata has refreshed and is no longer in the metadata?
For fetch it is a bit different since we have the session logic, and can handle missing topics.

I would recommend writing through a few cases where the server and client have/don't have the topic ID to reason about the upgrade case/downgrade case/deletions/reassignments.

If topic has been recreated and topic id is out of date, the client will get UNKNOWN_TOPIC_ID and on the retry the topic id will be updated

If topic has been reassigned to another broker then the client will get NOT_LEADER_OR_FOLLOWER and then the client can retry with the right broker.

Am not sure what upgrade case/downgrade you refer too here Do you mean the client and broker IBP combination? If yes then some of these are covered in ProduceRequestTest and RequestResponseTest

I added two test cases to cover the first two and the producer seem to self recover on retry.

Yes. For the fetch request for example, there is code to make sure that all topics have IDs before we can send the fetch request. This is a bit less of an issue now, but if we have a cluster that is running on a MV < 2.8, topics will not have IDs. So when we decide which version of produce we want to send, we want to be aware of this.

Not only that, but even if the broker supports topic IDs on all topics, we also may have a case where we need to do a rolling upgrade to get the code that supports handling the latest API version. This may be less complicated for Produce since it is a client only API and doesn't rely on MV/IBP, so the apiVersions exchange between the client and the broker may be enough to ensure api compatibility.

We just want to confirm these upgrade paths are compatible since produce is the hot path and we don't want any (or at least not extended) downtime in the middle of an upgrade.

clients/src/main/java/org/apache/kafka/clients/producer/internals/Sender.java

core/src/main/scala/kafka/server/KafkaApis.scala

jolshan · 2024-05-20T23:10:47Z

core/src/main/scala/kafka/server/ReplicaManager.scala

@@ -1361,10 +1366,10 @@ class ReplicaManager(val config: KafkaConfig,
 */
 private def appendToLocalLog(internalTopicsAllowed: Boolean,
 origin: AppendOrigin,
- entriesPerPartition: Map[TopicPartition, MemoryRecords],
+ entriesPerPartition: Map[TopicIdPartition, MemoryRecords],


is there a reason to pass this data structure here if we are not using the ID to check the append at the log level?

two reasons here

I didn't want to keep convert between TopicIdPartitions to TopicPartition

KAFKA-16212 will eventually use TopicIdPartitions to getPartitionOrException

Ok -- once we start using these across the log layer it makes sense.

jolshan · 2024-05-20T23:31:21Z

I would recommend taking a look at where we are passing the topic ID through and the checks we do. If we think it is useful to ensure we are writing to the right topic, we should do it, but if it is just adding complexity, we may want to consider changing.

OmniaGM · 2024-05-22T09:42:02Z

Topic name and Topic Id will become optional following the footstep of FetchRequest/FetchResponse

My understanding is that all requests going forward will use ID and not name similar to fetch request. I believe that is what is in the PR, but the comment suggests otherwise.

I meant that in Json files both will be marked ignorable

OmniaGM added 4 commits May 8, 2024 14:46

KAFKA-10551: Add topic id support to produce request and response

d3acdf0

KAFKA-10551: fix compatibility with 2.6 IBP

ddeac1b

KAFKA-10551: refactor

a92abe6

Merge remote-tracking branch 'apache/trunk' into KAFKA-10551-produceR…

afede12

…equest

OmniaGM marked this pull request as draft May 15, 2024 15:53

fix test

a8f0c91

OmniaGM force-pushed the KAFKA-10551-produceRequest branch from 1805c4f to a8f0c91 Compare May 15, 2024 22:05

OmniaGM marked this pull request as ready for review May 15, 2024 22:58

OmniaGM changed the title ~~Kafka-10551: Add topic id support to produce request and response~~ KAFKA-10551: Add topic id support to produce request and response May 15, 2024

fix integration tests

8c3602b

OmniaGM force-pushed the KAFKA-10551-produceRequest branch from 63a6032 to 8c3602b Compare May 16, 2024 15:06

jolshan reviewed May 20, 2024

View reviewed changes

clients/src/main/java/org/apache/kafka/clients/producer/internals/Sender.java Outdated Show resolved Hide resolved

jolshan reviewed May 20, 2024

View reviewed changes

core/src/main/scala/kafka/server/KafkaApis.scala Outdated Show resolved Hide resolved

jolshan reviewed May 20, 2024

View reviewed changes

OmniaGM added 2 commits May 22, 2024 15:03

Merge branch 'trunk' into KAFKA-10551-produceRequest

cba0a2d

address part of the feedback

27ed97b

OmniaGM force-pushed the KAFKA-10551-produceRequest branch from 8daeb62 to 1abc2ac Compare May 28, 2024 15:43

Add testing for while recreate the topic and reassignment

35dba4b

OmniaGM force-pushed the KAFKA-10551-produceRequest branch from 1abc2ac to 35dba4b Compare May 28, 2024 16:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KAFKA-10551: Add topic id support to produce request and response #15968

KAFKA-10551: Add topic id support to produce request and response #15968

OmniaGM commented May 15, 2024 •

edited

OmniaGM commented May 16, 2024

OmniaGM commented May 20, 2024

jolshan commented May 20, 2024 •

edited

jolshan May 20, 2024

OmniaGM May 28, 2024 •

edited

jolshan May 31, 2024 •

edited

jolshan May 20, 2024

OmniaGM May 22, 2024

jolshan May 22, 2024

jolshan commented May 20, 2024

OmniaGM commented May 22, 2024

KAFKA-10551: Add topic id support to produce request and response #15968

Are you sure you want to change the base?

KAFKA-10551: Add topic id support to produce request and response #15968

Conversation

OmniaGM commented May 15, 2024 • edited

Committer Checklist (excluded from commit message)

OmniaGM commented May 16, 2024

OmniaGM commented May 20, 2024

jolshan commented May 20, 2024 • edited

jolshan May 20, 2024

Choose a reason for hiding this comment

OmniaGM May 28, 2024 • edited

Choose a reason for hiding this comment

jolshan May 31, 2024 • edited

Choose a reason for hiding this comment

jolshan May 20, 2024

Choose a reason for hiding this comment

OmniaGM May 22, 2024

Choose a reason for hiding this comment

jolshan May 22, 2024

Choose a reason for hiding this comment

jolshan commented May 20, 2024

OmniaGM commented May 22, 2024

OmniaGM commented May 15, 2024 •

edited

jolshan commented May 20, 2024 •

edited

OmniaGM May 28, 2024 •

edited

jolshan May 31, 2024 •

edited