feat(pairwise evals): add option to randomize order of experiments; print URL for pairwise experiment #672

samnoyes · 2024-05-08T16:32:22Z

No description provided.

hinthornw · 2024-05-08T16:37:22Z

python/langsmith/evaluation/_runner.py

@@ -586,6 +590,13 @@ def evaluate_comparative(
        raise ValueError("max_concurrency must be a positive integer.")
    client = client or langsmith.Client()

+    if randomize_order:


I think I'd do this on a run level to average out bias within an experiment (as it stands, this will still have a consistent bias to 1 experiment, just random which one)

hinthornw · 2024-05-08T17:28:21Z

python/langsmith/evaluation/_runner.py

+        dataset_id = comparative_experiment.reference_dataset_id
+        base_url = project_url.split("/projects/p/")[0]
+        comparison_url = (
+            f"{base_url}/datasets/{dataset_id}/compare?"


Nice - at some point may be nice to factor this uot

Does this work if there's > 2 experiments? Is it more future proof if we just added a list of all the IDs?

good point, will do that

feat(pairwise evals): add option to randomize order of experiments

651da8f

samnoyes requested a review from hinthornw May 8, 2024 16:32

hinthornw reviewed May 8, 2024

View reviewed changes

print link, randomize runs order

88024ec

samnoyes changed the title ~~feat(pairwise evals): add option to randomize order of experiments~~ feat(pairwise evals): add option to randomize order of experiments; print URL for pairwise experiment May 8, 2024

hinthornw approved these changes May 8, 2024

View reviewed changes

samnoyes added 3 commits May 8, 2024 10:39

Update _runner.py

6793581

lint

e78ae9f

version bump

afd97f5

samnoyes merged commit 1f465d8 into main May 8, 2024
4 of 6 checks passed

samnoyes deleted the sam/add-randomize-to-pairwise branch May 8, 2024 20:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(pairwise evals): add option to randomize order of experiments; print URL for pairwise experiment #672

feat(pairwise evals): add option to randomize order of experiments; print URL for pairwise experiment #672

samnoyes commented May 8, 2024

hinthornw May 8, 2024 •

edited

Loading

hinthornw May 8, 2024

hinthornw May 8, 2024

samnoyes May 8, 2024

feat(pairwise evals): add option to randomize order of experiments; print URL for pairwise experiment #672

feat(pairwise evals): add option to randomize order of experiments; print URL for pairwise experiment #672

Conversation

samnoyes commented May 8, 2024

hinthornw May 8, 2024 • edited Loading

Choose a reason for hiding this comment

hinthornw May 8, 2024

Choose a reason for hiding this comment

hinthornw May 8, 2024

Choose a reason for hiding this comment

samnoyes May 8, 2024

Choose a reason for hiding this comment

hinthornw May 8, 2024 •

edited

Loading