Leaderworkerset v0.3.0
Features:
- RollingUpdate with MaxSurge support
- Subgroup support for disaggregated serving
- Example for multi-node serving of llama 70B on GPUs with vLLM
- Add a new start policy API
- Inject leader address environment variable to every container
- Spec.rolloutStrategy should be a non-required field
Acknowledgments
Thanks to our contributors in this release, in alphabetic order:
@ahg-g @Edwinhr716 @googs1025 @gujingit @jjk-g @kerthcet @liurupeng @nayihz