Apache Beam support for Pulsar #18453
Replies: 23 comments
-
this should be a request in the beam repo. |
Beta Was this translation helpful? Give feedback.
-
okay, looks like there is a story for that (https://issues.apache.org/jira/browse/BEAM-8218), but anyway it would be a push for pulsar. Maybe working together on this is an option. |
Beta Was this translation helpful? Give feedback.
-
@svenhornberg there is a discussion in BEAM mailing list. We will collaborate with beam community on adding a pulsar connector in BEAM repo. |
Beta Was this translation helpful? Give feedback.
-
Any statement if you would try to implement it ? 3 months have passed |
Beta Was this translation helpful? Give feedback.
-
@svenhornberg The beam community was trying to drive the development for this connector. Since they are the experts of beam. I didn't closely follow the progress there. Happy to follow up with them and get back here. |
Beta Was this translation helpful? Give feedback.
-
@sijie thank you for clarifying |
Beta Was this translation helpful? Give feedback.
-
Any updates on this matter? Would be interested in a Beam Connector too :) |
Beta Was this translation helpful? Give feedback.
-
@rfuerst87 - the beam community is doing the work. You can track the progress here - https://issues.apache.org/jira/browse/BEAM-8218 I will follow up there as well. |
Beta Was this translation helpful? Give feedback.
-
@sijie Thanks for the update. Thought you might have some more insights from the mailing list. Will follow the topic in JIRA. |
Beta Was this translation helpful? Give feedback.
-
I've taken over the ticket. I'm familiar with Beam, less so with Pulsar (I want to learn Pulsar, creating the IO). I'll be starting locally with Pulsar in standalone mode, as soon as I have a prototype I'll keep updates on this thread. |
Beta Was this translation helpful? Give feedback.
-
@alexvanboxel awesome! much appreciated your help! You can also check how Pulsar was integrated with Spark and Flink as a reference. The pulsar-flink one is the one is contributed to upstream Flink as FLIP-72 and https://github.com/streamnative/pulsar-spark @yjshen can help answer any questions about such integrations. |
Beta Was this translation helpful? Give feedback.
-
Great, I appreciate the help. I can happily report that after a few hours I already got some basic Google Cloud Pubsub -> Apache Beam -> Pulsar pipeline working (on the local runner). Don't get too excited to, still a lot of work to be done. I'll have a look at the integrations. |
Beta Was this translation helpful? Give feedback.
-
Cool, :) |
Beta Was this translation helpful? Give feedback.
-
Any update? |
Beta Was this translation helpful? Give feedback.
-
when looking into beams i/o connector matrix pulsar is not listed yet: |
Beta Was this translation helpful? Give feedback.
-
but there is a sdk package |
Beta Was this translation helpful? Give feedback.
-
to get an idea of the current state, there are some open issues in beam for pulsar: |
Beta Was this translation helpful? Give feedback.
-
in v2.39 of beam a relatively basic pulsar io was introduced: |
Beta Was this translation helpful? Give feedback.
-
I see that a Pulsar connector was made available as of BEAM 2.38.0 release but I don't see Pulsar as an official connector on below page. Is the Pulsar IO connector official or not? If official then can someone please update the page since it gives idea that a Pulsar IO connector is not available. |
Beta Was this translation helpful? Give feedback.
-
just opened an issue in Beam repository to maintain a good overview on this: |
Beta Was this translation helpful? Give feedback.
-
opened an issue to add current state of Pulsar support to Beams doc: |
Beta Was this translation helpful? Give feedback.
-
the progress for pulsar on becoming a first class citizen in beam, seems to have stucked :-( There are not only some
Have integrated them into the overview: apache/beam#31078 |
Beta Was this translation helpful? Give feedback.
-
if you are interested in Beam support for Pulsar plz upvote this discussion topic to gain visibility |
Beta Was this translation helpful? Give feedback.
-
I would like to use Apache Beam and Pulsar as my source, for writing my batch or streaming jobs.
Please consider adding a PulsarIO class to the sdk like Kafka (https://beam.apache.org/releases/javadoc/2.4.0/org/apache/beam/sdk/io/kafka/KafkaIO.html)
I found the pulsar-flink and pulsar-spark folders, but i really like the Apache Beam API.
Maybe you could add a pulsar-beam example or subfolder
Beta Was this translation helpful? Give feedback.
All reactions