Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: reuse the same bigquery table for multiple model versions for Arize BQ Sink #531

Merged
merged 1 commit into from
Feb 19, 2024

Conversation

khorshuheng
Copy link
Collaborator

@khorshuheng khorshuheng commented Feb 5, 2024

Description

The current sink create a new bigquery table per model version. This makes it harder to implement Arize ground truth ingestion, because the ground truth provided by the users are typically model version agnostic.

Modifications

  • A single table will be used per model id, rather than model version
  • A new column, model_version, is added.
  • session id and row id are used in favor of prediction id as the concept of prediction id in Arize differs from Merlin

Tests

Checklist

  • Added PR label
  • Added unit test, integration, and/or e2e tests
  • Tested locally
  • Updated documentation
  • Update Swagger spec if the PR introduce API changes
  • Regenerated Golang and Python client if the PR introduces API changes

Release Notes


@ghost
Copy link

ghost commented Feb 5, 2024

👇 Click on the image for a new way to code review

Review these changes using an interactive CodeSee Map

Legend

CodeSee Map legend

@khorshuheng khorshuheng added the enhancement New feature or request label Feb 5, 2024
@khorshuheng khorshuheng changed the title feat: reuse the same bigquery table for multiple model versions for A… feat: reuse the same bigquery table for multiple model versions for Arize BQ Sink Feb 5, 2024
Copy link
Contributor

@tiopramayudi tiopramayudi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM ! Thanks @khorshuheng

@khorshuheng khorshuheng merged commit 9df00a5 into main Feb 19, 2024
32 checks passed
@khorshuheng khorshuheng deleted the general-bq-sink branch February 19, 2024 06:45
leonlnj pushed a commit that referenced this pull request Feb 20, 2024
…rize BQ Sink (#531)

<!--  Thanks for sending a pull request!  Here are some tips for you:

1. Run unit tests and ensure that they are passing
2. If your change introduces any API changes, make sure to update the
e2e tests
3. Make sure documentation is updated for your PR!

-->
# Description
<!-- Briefly describe the motivation for the change. Please include
illustrations where appropriate. -->
The current sink create a new bigquery table per model version. This
makes it harder to implement Arize ground truth ingestion, because the
ground truth provided by the users are typically model version agnostic.

# Modifications
<!-- Summarize the key code changes. -->
- A single table will be used per model id, rather than model version
- A new column, model_version, is added.
- session id and row id are used in favor of prediction id as the
concept of prediction id in Arize differs from Merlin

# Tests
<!-- Besides the existing / updated automated tests, what specific
scenarios should be tested? Consider the backward compatibility of the
changes, whether corner cases are covered, etc. Please describe the
tests and check the ones that have been completed. Eg:
- [x] Deploying new and existing standard models
- [ ] Deploying PyFunc models
-->

# Checklist
- [ ] Added PR label
- [ ] Added unit test, integration, and/or e2e tests
- [ ] Tested locally
- [ ] Updated documentation
- [ ] Update Swagger spec if the PR introduce API changes
- [ ] Regenerated Golang and Python client if the PR introduces API
changes

# Release Notes
<!--
Does this PR introduce a user-facing change?
If no, just write "NONE" in the release-note block below.
If yes, a release note is required. Enter your extended release note in
the block below.
If the PR requires additional action from users switching to the new
release, include the string "action required".

For more information about release notes, see kubernetes' guide here:
http://git.k8s.io/community/contributors/guide/release-notes.md
-->

```release-note

```
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants