Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Duet AI Transform service SDK Harness prompts #30499

Merged

Conversation

olehborysevych
Copy link
Collaborator

Transform service lookup prompts


Thank you for your contribution! Follow this checklist to help us incorporate your contribution quickly and easily:

  • Mention the appropriate issue in your description (for example: addresses #123), if applicable. This will automatically add a link to the pull request in the issue. If you would like the issue to automatically close on merging the pull request, comment fixes #<ISSUE NUMBER> instead.
  • Update CHANGES.md with noteworthy changes.
  • If this contribution is large, please file an Apache Individual Contributor License Agreement.

See the Contributor Guide for more tips on how to make review process smoother.

To check the build health, please visit https://github.com/apache/beam/blob/master/.test-infra/BUILD_STATUS.md

GitHub Actions Tests Status (on master branch)

Build python source distribution and wheels
Python tests
Java tests
Go tests

See CI.md for more information about GitHub Actions CI or the workflows README to see a list of phrases to trigger workflows.

Copy link
Contributor

@dariabezkorovaina dariabezkorovaina left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@olehborysevych looks good, I improved the wording and fixed minor issues with formatting.

Comment on lines 17 to 18
Currently the following transforms are supported:

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
Currently the following transforms are supported:
Currently, the Transform service can upgrade the following transforms:

Comment on lines 19 to 23
* BigQuery read transform: `beam:transform:org.apache.beam:bigquery_read:v1`
* BigQuery write transform: `beam:transform:org.apache.beam:bigquery_write:v1`
* Kafka read transform: `beam:transform:org.apache.beam:kafka_read_with_metadata:v2`
* Kafka write transform: `beam:transform:org.apache.beam:kafka_write:v2`

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
* BigQuery read transform: `beam:transform:org.apache.beam:bigquery_read:v1`
* BigQuery write transform: `beam:transform:org.apache.beam:bigquery_write:v1`
* Kafka read transform: `beam:transform:org.apache.beam:kafka_read_with_metadata:v2`
* Kafka write transform: `beam:transform:org.apache.beam:kafka_write:v2`
* BigQuery read: `beam:transform:org.apache.beam:bigquery_read:v1`
* BigQuery write: `beam:transform:org.apache.beam:bigquery_write:v1`
* Kafka read: `beam:transform:org.apache.beam:kafka_read_with_metadata:v2`
* Kafka write: `beam:transform:org.apache.beam:kafka_write:v2`

* Kafka read transform: `beam:transform:org.apache.beam:kafka_read_with_metadata:v2`
* Kafka write transform: `beam:transform:org.apache.beam:kafka_write:v2`

Transform service implement Beam expansion API. This means you can use the Transform service to construct and execute multi-language pipelines. For example, you can build a Python pipeline that uses Java `KafkaIO` transform and execute in without installing Java locally.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
Transform service implement Beam expansion API. This means you can use the Transform service to construct and execute multi-language pipelines. For example, you can build a Python pipeline that uses Java `KafkaIO` transform and execute in without installing Java locally.
The Transform service implements the Beam expansion API, enabling multi-language pipelines to leverage it for expanding supported transforms. This feature allows you to create and run multi-language pipelines without additional language runtimes. For instance, you can build a Python pipeline that utilizes a Java `KafkaIO` transform without the need to install Java locally.


Transform service implement Beam expansion API. This means you can use the Transform service to construct and execute multi-language pipelines. For example, you can build a Python pipeline that uses Java `KafkaIO` transform and execute in without installing Java locally.

Transform service can be started automatically by Apache Beam SDK or manually by users:
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
Transform service can be started automatically by Apache Beam SDK or manually by users:
In some cases, Apache Beam SDKs can start the Transform service automatically, provided that Docker is available locally. You can also start the Transform service manually by running the following command:

Comment on lines 32 to 57
Beam transform service includes a number of transforms implemented in the Apache Beam Java and Python SDKs:

* Java transforms: Google Cloud I/O connectors, the Kafka I/O connector, and the JDBC I/O connector
* Python transforms: all portable transforms implemented within the Apache Beam Python SDK, such as RunInference and DataFrame transforms.






















Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
Beam transform service includes a number of transforms implemented in the Apache Beam Java and Python SDKs:
* Java transforms: Google Cloud I/O connectors, the Kafka I/O connector, and the JDBC I/O connector
* Python transforms: all portable transforms implemented within the Apache Beam Python SDK, such as RunInference and DataFrame transforms.
The Beam Transform service includes several portable transforms implemented in the Apache Beam Java and Python SDKs:
* Java transforms: Google Cloud I/O connectors, the Kafka I/O connector, and the JDBC I/O connector.
* Python transforms: all portable transforms implemented within the Apache Beam Python SDK, such as RunInference and DataFrame transforms.

Co-authored-by: Daria Bezkorovaina <[email protected]>
@olehborysevych olehborysevych changed the title [DRAFT] transform service prompt added Duet AI Transform service SDK Harness prompts Mar 11, 2024
@olehborysevych olehborysevych marked this pull request as ready for review March 11, 2024 07:56
@olehborysevych
Copy link
Collaborator Author

Hey @damccorm this one is also ready for review

Copy link
Contributor

Assigning reviewers. If you would like to opt out of this review, comment assign to next reviewer:

R: @AnandInguva added as fallback since no labels match configuration

Available commands:

  • stop reviewer notifications - opt out of the automated review tooling
  • remind me after tests pass - tag the comment author after tests pass
  • waiting on author - shift the attention set back to the author (any comment or push by the author will return the attention set to the reviewers)

The PR bot will only process comments in the main thread (not review comments).

Copy link
Contributor

@damccorm damccorm left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks!

@damccorm damccorm merged commit 31b8a9e into apache:master Mar 11, 2024
4 checks passed
hjtran pushed a commit to hjtran/beam that referenced this pull request Apr 4, 2024
* transform service prompt added

* SDK harness added

* fix whitespace

* Apply suggestions from code review

Co-authored-by: Daria Bezkorovaina <[email protected]>

---------

Co-authored-by: Daria Bezkorovaina <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants