Skip to content

Initial release of Model Server

Compare
Choose a tag to compare
@lupesko lupesko released this 04 Dec 05:31
· 956 commits to master since this release
1de80b5

Key capabilities of Model Server for Apache MXNet v0.1.5:

  • Tooling to package and export all model artifacts into a single “model archive” file that encapsulates everything required for serving an MXNet model.
  • Automated setup of a serving stack, including HTTP inference endpoints, MXNet-based engine, all automatically configured for the specific models being hosted.
  • Pre-configured Docker images, setup with NGINX, MXNet and MMS, for scalable model serving.
  • Ability to customize every step in the inference execution pipeline, from model initialization, through pre-processing and inference, and up to post-processing the model’s output.
  • Real time operational metrics to monitor the inference service and endpoints, covering key metrics such as latencies, resource utilization and errors.
  • OpenAPI-enabled service, that is easy to integrate with, and that can auto-generate client code for popular stacks such as Java, JavaScript, C# and more.