Leaderworkerset v0.2.0
Features:
- Support RollingUpdate with MaxUnavailable
- Allow Prometheus to gather metrics gathered by controller-runtime
- Fix TPU env var assignment when leader pod doesn't request TPU
- User guide to deploy multi-host inference with Saxml
- Increase qps limit for pod scheduling
- Setup E2E test and improve test coverage
Acknowledgments
Thanks to our contributors in this release, in alphabetic order:
@ahg-g @Bslabe123 @Edwinhr716 @googs1025 @kannon92 @kerthcet @liurupeng @nayihz @Zeel-Patel