Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat(pairwise evals): add option to randomize order of experiments; print URL for pairwise experiment #672

Merged
merged 5 commits into from
May 8, 2024

Conversation

samnoyes
Copy link
Contributor

@samnoyes samnoyes commented May 8, 2024

No description provided.

@samnoyes samnoyes requested a review from hinthornw May 8, 2024 16:32
@@ -586,6 +590,13 @@ def evaluate_comparative(
raise ValueError("max_concurrency must be a positive integer.")
client = client or langsmith.Client()

if randomize_order:
Copy link
Collaborator

@hinthornw hinthornw May 8, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think I'd do this on a run level to average out bias within an experiment (as it stands, this will still have a consistent bias to 1 experiment, just random which one)

@samnoyes samnoyes changed the title feat(pairwise evals): add option to randomize order of experiments feat(pairwise evals): add option to randomize order of experiments; print URL for pairwise experiment May 8, 2024
dataset_id = comparative_experiment.reference_dataset_id
base_url = project_url.split("/projects/p/")[0]
comparison_url = (
f"{base_url}/datasets/{dataset_id}/compare?"
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nice - at some point may be nice to factor this uot

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Does this work if there's > 2 experiments? Is it more future proof if we just added a list of all the IDs?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

good point, will do that

@samnoyes samnoyes merged commit 1f465d8 into main May 8, 2024
4 of 6 checks passed
@samnoyes samnoyes deleted the sam/add-randomize-to-pairwise branch May 8, 2024 20:09
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants