[Bug]: ReadFromKafka not forwarding in streaming mode version on portable runners #25114

jihad-akl · 2023-01-22T08:32:57Z

Abacn · 2023-01-22T23:24:16Z

streaming by definition will not end; despite python directly runner is not for production and do not have full support for streaming. This is most likely working as intended

jihad-akl · 2023-01-23T06:37:58Z

True, so how can use the apache beam pipeline in streaming mode if it only gather data and not send them to the next step?
The print received does not trigger every time I receive a message from kafka locally. Isn't it an important bug? and the direct runner need some fixes?

tvalentyn · 2023-01-24T02:10:03Z

#24528 tracks various issues related to streaming direct runner. I am not sure if it is able to run a simple KafkaIO pipeline. Are you able to use a portable Flink Runner by chance

jihad-akl · 2023-01-24T07:47:17Z

I am trying to to implement it but till now I am facing the same issue, I can see my pipeline in the apache flink localhost:8081 but nothing happens, I am debugging it to see if I made any mistake

jihad-akl · 2023-01-24T13:53:42Z

So after researching and testing, I found that the Flink Runner does not help because apache flink gather a lot of data before releasing them, my use case is every message I receive from kafka I need to forwarded to the next step in the pipeline (locally).

Abacn · 2023-01-24T15:10:41Z

fyi if flink runner has the same issue, it may hit #22809, the issue in python side may still persist
CC: @johnjcasey
I will also take a look

jihad-akl · 2023-01-25T06:58:51Z

To reproduce:
consumer_config.json:
{
"bootstrap.servers": "127.0.0.1:9092"
}
main.py:
topic = ["multi-video-stream"]

beam_options = PipelineOptions(["--runner=FlinkRunner","--flink_version=1.15","--flink_master=localhost:8081"
                                ,"--environment_type=LOOPBACK","--streaming"])
with beam.Pipeline(options=beam_options) as p:
    messages = p | 'Read from kafka' >> ReadFromKafka(consumer_config=json.load(open("consumer_config.json"))
                                                   ,topics=topic)
    messages | 'Print Messages' >> beam.Map(print)

jihad-akl · 2023-01-25T07:09:28Z

I am using this producer_config:
{
"bootstrap.servers": "localhost:9092",
"enable.idempotence": true,
"retries": 100,
"max.in.flight.requests.per.connection": 5,
"compression.type": "snappy",
"linger.ms": 5,
"batch.num.messages": 1,
"queue.buffering.max.ms": 0,
"queue.buffering.max.messages": 10
}
and this kafka: yml
https://github.com/conduktor/kafka-stack-docker-compose/blob/master/zk-single-kafka-single.yml
for the producer code:

producer = Producer(json.load(open("producer_config.json")))
frame_no = 0
while True:

    frame_bytes = "hello" + str(frame_no)
    producer.produce(
        topic="multi-video-stream", 
        value=frame_bytes, 
        on_delivery=delivery_report,
        timestamp=frame_no,
        headers={
            "test": str.encode("test")
        }
    )
    frame_no+=1
    # producer.poll(1)
    producer.flush()
    
    time.sleep(0.1)

jihad-akl · 2023-01-25T07:34:56Z

Please Note that if I use flink runner 1.14 I get:

ERROR:apache_beam.utils.subprocess_server:Starting job service with ['java', '-jar', '/root/.apache_beam/cache/jars/beam-runners-flink-1.14-job-server-2.44.0.jar', '--flink-master', 'http://localhost:8081', '--artifacts-dir', '/tmp/beam-temp1of29sbe/artifactsz8je29uo', '--job-port', '33755', '--artifact-port', '0', '--expansion-port', '0']
ERROR:apache_beam.utils.subprocess_server:Error bringing up service
Traceback (most recent call last):
File "/usr/local/lib/python3.10/dist-packages/apache_beam/utils/subprocess_server.py", line 88, in start
raise RuntimeError(
RuntimeError: Service failed to start up with error 1
Traceback (most recent call last):
File "main.py", line 38, in
with beam.Pipeline(options=beam_options) as p:
File "/usr/local/lib/python3.10/dist-packages/apache_beam/pipeline.py", line 600, in exit
self.result = self.run()
File "/usr/local/lib/python3.10/dist-packages/apache_beam/pipeline.py", line 577, in run
return self.runner.run_pipeline(self, self._options)
File "/usr/local/lib/python3.10/dist-packages/apache_beam/runners/portability/flink_runner.py", line 45, in run_pipeline
return super().run_pipeline(pipeline, options)
File "/usr/local/lib/python3.10/dist-packages/apache_beam/runners/portability/portable_runner.py", line 439, in run_pipeline
job_service_handle = self.create_job_service(options)
File "/usr/local/lib/python3.10/dist-packages/apache_beam/runners/portability/portable_runner.py", line 318, in create_job_service
return self.create_job_service_handle(server.start(), options)
File "/usr/local/lib/python3.10/dist-packages/apache_beam/runners/portability/job_server.py", line 81, in start
self._endpoint = self._job_server.start()
File "/usr/local/lib/python3.10/dist-packages/apache_beam/runners/portability/job_server.py", line 110, in start
return self._server.start()
File "/usr/local/lib/python3.10/dist-packages/apache_beam/utils/subprocess_server.py", line 88, in start
raise RuntimeError(
RuntimeError: Service failed to start up with error

Abacn · 2023-01-27T22:25:39Z

per #22809 the cause is likely #20979. It is due to a feature lacking on python portable runner. Dataflow runner is not affected. The thing I am not sure is why the unbounded reader also not working.

vjixy · 2023-01-29T17:27:42Z

So there is bugs in the portable runners? :(

Abacn · 2023-02-03T14:55:17Z

yes, or feature missing

alexmreis · 2023-02-04T01:35:54Z

The implementation of Kafka in the Python SDK + Portable Runner is unfortunately rather broken for streaming use cases. I don't understand why there isn't a native python implementation based on https://github.com/confluentinc/confluent-kafka-python that doesn't have to deal with the portability layer. It would be much more reliable, even if maybe less capable of parallel compute.

Our company has abandoned Beam and Dataflow for this very reason. Last bug I opened in August 2022, #22809 was closed today but still depends on 2 other issues, one of which remains unsolved #25114 half a year later. The Python SDK is clearly not a priority for the core team. Maybe they're too busy focusing on GCP-specific products like PubSub to put in the effort to make open source tools, like Kafka, work properly in Beam's Python SDK. There isn't even a single unit test in the test suite for an unbounded Kafka stream being windowed and keyed.

As someone who really believes in Beam as a great portable standard for data engineering, it's sad to see the lack of interest from the core team in anything that is not making Google money (although we would still be paying for Dataflow if it worked).

Abacn · 2023-02-04T03:21:28Z

Hi @alexmreis sorry if there is any misunderstanding, #22809 is closed because the issue on KafkaIO side is fixed, by #24205 (it comments closes #22809: #24205 (comment)) That said, the use case of Dataflow Runner should be fixed in upcoming Beam v2.45.0

It still experiencing issues on portable runner (flink, direct streaming) is an issue not limited to kafka source. It affects all "splittable DoFn" streaming source. This functionality is not yet supported by portable runner (#20979). I also got bite by this issue quite often (when I validating the fix of #24205, see comments of #22809 I had). The gap between Dataflow and local runners is definitely an important thing need improve. This has direct impact to developers.

Besides, no unit test in Python Kafka IO is intended. Within the cross-language framework, the code running kafka read is Java's KafkaIO and unit test is exercised there. We have CrossLanguage Validation Runner (XVR) Tests for each xlang IO and each SDK exercised in schedule. And I recently added a Python KafkaIO performance test also. That said KafkaIO in both Java and Python are our team's priority.

hadikoub · 2023-03-02T15:05:26Z

Was this issue addressed in the new version 2.45?

Abacn · 2023-03-02T21:30:55Z

Was this issue addressed in the new version 2.45?

Not yet. This is the feature gap in portable runner. May need substantial effort. I am trying to work on it currently though

jihad-akl · 2023-05-11T09:57:18Z

Any update for this issue in version 2.47?

jihad-akl · 2023-06-02T08:30:09Z

Any update for this issue in version 2.48?

jihad-akl · 2023-06-02T08:31:00Z

Was this issue addressed in the new version 2.45?

Not yet. This is the feature gap in portable runner. May need substantial effort. I am trying to work on it currently though

Any update?

Abacn · 2023-06-02T12:35:28Z

Not able to get into this.

jihad-akl · 2023-07-07T12:24:09Z

Any update for this issue in version 2.49?

jihad-akl · 2023-07-31T11:02:44Z

Not able to get into this.

Any idea where the problem is to try and make a work around?

jihad-akl · 2023-09-08T05:17:17Z

Any news for version 2.50?

jihad-akl · 2023-10-18T06:48:48Z

Almost 1 year and didn't get any clear response if that bug will be fixed or no, 7 versions from version 2.44 to 2.51 and the bug remains.
Will this bug be fixed?

MrYanMYN · 2024-08-31T15:19:48Z

For anybody stumbling upon this issue, a year later this bug is still present

liferoad · 2024-08-31T15:31:09Z

There is no plan to fix this for Python DirectRunner. We are moving to Prism Runner (#29650). The goal is make this as the default one for all SDKs to allow users to do local tests and developments with this local runner. This work is currently on-going.

@kennknowles FYI. For Beam on Flink.

jihad-akl added awaiting triage bug labels Jan 22, 2023

github-actions bot added P1 python labels Jan 22, 2023

jihad-akl changed the title ~~[Bug]:~~ [Bug]: ReadFromKafka not forwarding in streaming mode version 2.44.0 Jan 22, 2023

Abacn added P2 and removed P1 labels Jan 22, 2023

tvalentyn removed the awaiting triage label Jan 24, 2023

Abacn mentioned this issue Jan 24, 2023

[Bug]: Python SDK gets stuck when using Unbounded PCollection in streaming mode on GroupByKey after ReadFromKafka on DirectRunner, FlinkRunner and DataflowRunner #22809

Closed

Abacn changed the title ~~[Bug]: ReadFromKafka not forwarding in streaming mode version 2.44.0~~ [Bug]: ReadFromKafka not forwarding in streaming mode version on portable runners Feb 3, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug]: ReadFromKafka not forwarding in streaming mode version on portable runners #25114

[Bug]: ReadFromKafka not forwarding in streaming mode version on portable runners #25114

jihad-akl commented Jan 22, 2023 •

edited

Loading

Abacn commented Jan 22, 2023

jihad-akl commented Jan 23, 2023 •

edited

Loading

tvalentyn commented Jan 24, 2023

jihad-akl commented Jan 24, 2023

jihad-akl commented Jan 24, 2023

Abacn commented Jan 24, 2023 •

edited

Loading

jihad-akl commented Jan 25, 2023

jihad-akl commented Jan 25, 2023 •

edited

Loading

jihad-akl commented Jan 25, 2023 •

edited

Loading

Abacn commented Jan 27, 2023 •

edited

Loading

vjixy commented Jan 29, 2023 •

edited

Loading

Abacn commented Feb 3, 2023

alexmreis commented Feb 4, 2023

Abacn commented Feb 4, 2023 •

edited

Loading

hadikoub commented Mar 2, 2023

Abacn commented Mar 2, 2023

jihad-akl commented May 11, 2023

jihad-akl commented Jun 2, 2023

jihad-akl commented Jun 2, 2023

Abacn commented Jun 2, 2023

jihad-akl commented Jul 7, 2023

jihad-akl commented Jul 31, 2023

jihad-akl commented Sep 8, 2023

jihad-akl commented Oct 18, 2023

MrYanMYN commented Aug 31, 2024

liferoad commented Aug 31, 2024

[Bug]: ReadFromKafka not forwarding in streaming mode version on portable runners #25114

[Bug]: ReadFromKafka not forwarding in streaming mode version on portable runners #25114

Comments

jihad-akl commented Jan 22, 2023 • edited Loading

What happened?

Issue Priority

Issue Components

Abacn commented Jan 22, 2023

jihad-akl commented Jan 23, 2023 • edited Loading

tvalentyn commented Jan 24, 2023

jihad-akl commented Jan 24, 2023

jihad-akl commented Jan 24, 2023

Abacn commented Jan 24, 2023 • edited Loading

jihad-akl commented Jan 25, 2023

jihad-akl commented Jan 25, 2023 • edited Loading

jihad-akl commented Jan 25, 2023 • edited Loading

Abacn commented Jan 27, 2023 • edited Loading

vjixy commented Jan 29, 2023 • edited Loading

Abacn commented Feb 3, 2023

alexmreis commented Feb 4, 2023

Abacn commented Feb 4, 2023 • edited Loading

hadikoub commented Mar 2, 2023

Abacn commented Mar 2, 2023

jihad-akl commented May 11, 2023

jihad-akl commented Jun 2, 2023

jihad-akl commented Jun 2, 2023

Abacn commented Jun 2, 2023

jihad-akl commented Jul 7, 2023

jihad-akl commented Jul 31, 2023

jihad-akl commented Sep 8, 2023

jihad-akl commented Oct 18, 2023

MrYanMYN commented Aug 31, 2024

liferoad commented Aug 31, 2024

jihad-akl commented Jan 22, 2023 •

edited

Loading

jihad-akl commented Jan 23, 2023 •

edited

Loading

Abacn commented Jan 24, 2023 •

edited

Loading

jihad-akl commented Jan 25, 2023 •

edited

Loading

jihad-akl commented Jan 25, 2023 •

edited

Loading

Abacn commented Jan 27, 2023 •

edited

Loading

vjixy commented Jan 29, 2023 •

edited

Loading

Abacn commented Feb 4, 2023 •

edited

Loading