Release Initial release of Model Server · awslabs/multi-model-server

Key capabilities of Model Server for Apache MXNet v0.1.5:

Tooling to package and export all model artifacts into a single “model archive” file that encapsulates everything required for serving an MXNet model.
Automated setup of a serving stack, including HTTP inference endpoints, MXNet-based engine, all automatically configured for the specific models being hosted.
Pre-configured Docker images, setup with NGINX, MXNet and MMS, for scalable model serving.
Ability to customize every step in the inference execution pipeline, from model initialization, through pre-processing and inference, and up to post-processing the model’s output.
Real time operational metrics to monitor the inference service and endpoints, covering key metrics such as latencies, resource utilization and errors.
OpenAPI-enabled service, that is easy to integrate with, and that can auto-generate client code for popular stacks such as Java, JavaScript, C# and more.

Provide feedback