inference-server

Pluggable Python HTTP web service (WSGI) for real-time AI/ML model inference compatible with Amazon SageMaker.

Implemented as a werkzeug WSGI application which can be served using a Gunicorn webserver, for example.

Contents