Release v0.2.32
What's Changed
- Fix for single turn dataset by @toslunar in #2509
- replace os.getenv with os.path.expanduser because the first one doesn… by @khalil-Hennara in #2515
- Fix arena by @merrymercy in #2522
- Update Dockerfile by @dubaoquan404 in #2524
- add Llama2ChangAdapter by @lcw99 in #2510
- Add ExllamaV2 Inference Framework Support. by @leonxia1018 in #2455
- Improve docs by @merrymercy in #2534
- Fix warnings for new gradio versions by @merrymercy in #2538
- Improve chat templates by @merrymercy in #2539
- Add Zephyr 7B Alpha by @lewtun in #2535
- Improve Support for Mistral-Instruct by @Steve-Tech in #2547
- correct max_tokens by context_length instead of raise exception by @liunux4odoo in #2544
- Revert "Improve Support for Mistral-Instruct" by @merrymercy in #2552
- Fix Mistral template by @normster in #2529
- Add additional Informations from the vllm worker by @SebastianBodza in #2550
- Make FastChat work with LMSYS-Chat-1M Code by @CodingWithTim in #2551
- Create
tags
attribute to fixMarkupError
in rich CLI by @Steve-Tech in #2553 - move BaseModelWorker outside serve.model_worker to make it independent by @liunux4odoo in #2531
- Misc style and bug fixes by @merrymercy in #2559
- Fix README.md by @infwinston in #2561
- release v0.2.31 by @merrymercy in #2563
- resolves #2542 modify dockerfile to upgrade cuda to 12.2.0 and pydantic 1.10.13 by @alexdelapaz in #2565
- Add airoboros_v3 chat template (llama-2 format) by @jondurbin in #2564
- Add Xwin-LM V0.1, V0.2 support by @REIGN12 in #2566
- Fixed model_worker generate_gate may blocked main thread (#2540) by @lvxuan263 in #2562
- feat: add claude-v2 by @congchan in #2571
- Update vigogne template by @bofenghuang in #2580
- Fix issue #2568: --device mps led to TypeError: forward() got an unexpected keyword argument 'padding_mask'. by @Phil-U-U in #2579
- Add Mistral-7B-OpenOrca conversation_temmplate by @waynespa in #2585
- docs: bit misspell comments model adapter default template name conversation by @guspan-tanadi in #2594
- Update Mistral template by @Gk-rohan in #2581
- Update README.md (vicuna-v1.3 -> vicuna-1.5) by @infwinston in #2592
- Update README.md to highlight chatbot arena by @infwinston in #2596
- Add Lemur model by @ugolotti in #2584
- add trust_remote_code=True in BaseModelAdapter by @edisonwd in #2583
- Openai interface add use beam search and best of 2 by @leiwen83 in #2442
- Update qwen and add pygmalion by @Trangle in #2607
- feat: Support model AquilaChat2 by @fangyinc in #2616
- Added settings vllm by @SebastianBodza in #2599
- [Logprobs] Support logprobs=1 by @comaniac in #2612
New Contributors
- @toslunar made their first contribution in #2509
- @khalil-Hennara made their first contribution in #2515
- @dubaoquan404 made their first contribution in #2524
- @leonxia1018 made their first contribution in #2455
- @lewtun made their first contribution in #2535
- @normster made their first contribution in #2529
- @SebastianBodza made their first contribution in #2550
- @alexdelapaz made their first contribution in #2565
- @REIGN12 made their first contribution in #2566
- @lvxuan263 made their first contribution in #2562
- @Phil-U-U made their first contribution in #2579
- @waynespa made their first contribution in #2585
- @guspan-tanadi made their first contribution in #2594
- @Gk-rohan made their first contribution in #2581
- @ugolotti made their first contribution in #2584
- @edisonwd made their first contribution in #2583
- @fangyinc made their first contribution in #2616
- @comaniac made their first contribution in #2612
Full Changelog: v0.2.30...v0.2.32