-
Notifications
You must be signed in to change notification settings - Fork 4.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Portable runners should be able to issue checkpoints to Splittable DoFn #20979
Comments
I have managed to pull off the workaround of adding First, you'll need to start the Java expansion service on your own. If you are deploying the expansion service on docker, you can simply pull the flink job service image (
Explanation on each of the flags:
|
Since this is new functionality I think that P2 is the right level. This is still an important priority for portable runners to function properly with SDF. |
CC @chamikaramj since tagged with xlang |
This is coming back as we are pushing forward for the SDF implementations for various sources (kafka, generate sequence, etc) as well as en route to runner v2 of Dataflow runner. I have read the context of BEAM-11998 and would like to work on this. |
Another problem related to this issue is that running PeriodicSequence on Flink runner, the pipeline first runs for ~1 minute but then will fail with error
code is pretty simple:
|
Guys, maybe it's a complete stupid question, but AFAIUnderstand it the workaround works only in case there is single worker/machine doing the job, but what in case e.g. Dataproc or where the load is theoretically spread and u cannot control everywhere how the expansion service is spun up? |
The expansion happens before the job is launched. It isn't per-worker. |
Thanks!
Well, scratch the above... I re-invented the wheel by figuring it's not enough and u definitely need that extra service running manually, externally. 🤷 |
hi, is this issue resolved. Im also facing same issue as of now.Need a fix |
To execute unbounded Splittable DoFn over fnapi in streaming mode properly, portable runners should issue split(ProcessBundleSplitRequest with fraction_of_remainder > 0) or simply checkpoint(ProcessBundleSplitRequest with fraction_of_remainder == 0) to SDK regularly to make current bundle finished processing instead of running forever.
Imported from Jira BEAM-11998. Original Jira may contain additional context.
Reported by: boyuanz.
The text was updated successfully, but these errors were encountered: