Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Speedup Eval - Instrument With Otel #302

Merged
merged 4 commits into from
Oct 16, 2024
Merged

Speedup Eval - Instrument With Otel #302

merged 4 commits into from
Oct 16, 2024

Conversation

jlewi
Copy link
Owner

@jlewi jlewi commented Oct 15, 2024

Speedup Evaluation

Per #301, our experiments were really slow. We were observing times of approximately 30s to 60s to process individual examples. With the new dataset of 424 examples it would take ~7hours to run an experiment; significantly impeding iteration.

This PR instruments evaluation with OTEL so that we can see the bottlenecks in the code. Below are some graphs.

The graph below shows the P95 latency of processing evaluation examples. We can see its about 30s.
experiment_traces_p95_latency

Below is a heatmap showing the duration of waitForBlockLog
experiment_traces_heatmap_wait_for_block_log

We can see that its about 20s-30s so it accounts for a large portion of the duration.

A big source of the latency is the Analyzer uses a rate limiting queue for reprocessing the logs. The max delay is 30s so we are probably reprocessing the logs and 30s intervals. So to speed it up we make the delay configurable so we can use a shorter delay during experiments.

Here is an updated heatmap of the duration of wait for block log.
Screen Shot 2024-10-15 at 4 34 40 PM

16:25 is when it started running with a maxDelaySeconds of 1s. We can see this drops the latency down significantly from about 30s to ~6s

Copy link

netlify bot commented Oct 15, 2024

Deploy Preview for foyle canceled.

Name Link
🔨 Latest commit b1b6247
🔍 Latest deploy log https://app.netlify.com/sites/foyle/deploys/670f11c1cc43e40008b633d3

@jlewi jlewi enabled auto-merge (squash) October 16, 2024 01:07
@jlewi jlewi merged commit 6f3a967 into main Oct 16, 2024
5 checks passed
@jlewi jlewi deleted the jlewi/evalotel branch October 16, 2024 01:12
@jlewi jlewi mentioned this pull request Oct 16, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant