Benchmarking update - phase 1 #339

dbarbuzzi · 2024-06-26T20:13:11Z

This PR updates the benchmarking performed in remote-push and nightly runs according to the first set of deliverables from our recent meeting:

Only the benchmark_serving.json config is run
- This is accomplished with a new list, nm_benchmark_base_config_list.txt, other lists are untouched
The benchmark_serving.json has various reductions:
- Model list reduced to facebook/opt-350m and meta-llama/Meta-Llama-3-8B-Instruct
- nr-qps list reduced to 300,1
- Metric tracking reduced to mean TPOT and mean TTFT (other metrics still recorded/logged per usual)

There is also a small fix related to server startup (changing from localhost to 127.0.0.1 because localhost on the machines is mapped to the IPv6 ::1 which something in the server stack doesn’t seem to like).

In a commit prior to opening the PR with all functional changes, the full benchmark job took <30 min:
https://github.com/neuralmagic/nm-vllm/actions/runs/9669361155/job/26709082658

andy-neuma

thanks

dbarbuzzi and others added 12 commits June 25, 2024 16:11

fix benchmark_serving.json formatting

1520664

Reduce model list

c92a388

Reduce nr/qps pairs

b4a32ff

Reduce runs to benchmark_serving.json

e5155ac

Initial metric reduction

004a87e

Update server host

2101045

Debug HOSTS file

388fd55

Remove debug step

1b166f0

Update debug logging

496b061

Remove debug line

9ee1750

Merge branch 'main' into benchmarking-update

9f35fb6

Merge branch 'main' into benchmarking-update

5d868be

andy-neuma approved these changes Jun 28, 2024

View reviewed changes

derekk-nm approved these changes Jun 28, 2024

View reviewed changes

dbarbuzzi merged commit 569c905 into main Jun 28, 2024
28 checks passed

dbarbuzzi deleted the benchmarking-update branch June 28, 2024 20:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Benchmarking update - phase 1 #339

Benchmarking update - phase 1 #339

dbarbuzzi commented Jun 26, 2024

andy-neuma left a comment

Benchmarking update - phase 1 #339

Benchmarking update - phase 1 #339

Conversation

dbarbuzzi commented Jun 26, 2024

andy-neuma left a comment

Choose a reason for hiding this comment