Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Bug]: Prism runner "received message larger than max" error #29147

Closed
1 of 16 tasks
miracvbasaran opened this issue Oct 26, 2023 · 3 comments · Fixed by #29178
Closed
1 of 16 tasks

[Bug]: Prism runner "received message larger than max" error #29147

miracvbasaran opened this issue Oct 26, 2023 · 3 comments · Fixed by #29178
Assignees

Comments

@miracvbasaran
Copy link
Contributor

What happened?

While updating Privacy on Beam's dependency on Apache Beam Go to version v2.51.0, I am receiving the following error:

2023/10/26 15:12:58 stopping worker job-001[go-job-1-1698325959973818582]_go
    distinct_id_test.go:318: DistinctPrivacyID didn't add any Laplace noise: job failed to prepare
        	caused by:
        rpc error: code = ResourceExhausted desc = grpc: received message larger than max (4236507 vs. 4194304)
        error may be due to Apache Beam Go's migration from the direct runner to the prism runner. While the failure(s) should be fixed, you can continue to use the direct runner with this TestMain override: `func TestMain(m *testing.M) { ptest.MainWithDefault(m, "direct") }`
--- FAIL: TestDistinctPrivacyIDAddsNoise (19.35s)

on this test. Curiously, it works fine for all the other tests.

Is this a known issue? Are there any actions I can take to resolve this?

Issue Priority

Priority: 2 (default / most bugs should be filed as P2)

Issue Components

  • Component: Python SDK
  • Component: Java SDK
  • Component: Go SDK
  • Component: Typescript SDK
  • Component: IO connector
  • Component: Beam YAML
  • Component: Beam examples
  • Component: Beam playground
  • Component: Beam katas
  • Component: Website
  • Component: Spark Runner
  • Component: Flink Runner
  • Component: Samza Runner
  • Component: Twister2 Runner
  • Component: Hazelcast Jet Runner
  • Component: Google Cloud Dataflow Runner
@lostluck
Copy link
Contributor

Ah good catch. I thought I increased message size to maximum everywhere.

Do note that prism isn't yet optimized for performance. It will presently be slower than the direct runner. In the short term though, as the error says, override to use the direct runner.

@lostluck
Copy link
Contributor

lostluck commented Oct 27, 2023

OH! Job failed to prepare means it's failing on pipeline submission.

Your pipeline is > 4MB serialized.

(Still can be fixed however.)

@miracvbasaran
Copy link
Contributor Author

Thanks for the fix!

@github-actions github-actions bot added this to the 2.52.0 Release milestone Oct 30, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants