Machine Learning Software-as-a-Service providers with a lot of tenants need to keep these tenants’ prediction endpoints separate. Here are several ways to do that, balancing expense and robustness. We show a way to do flexible access control to the Sagemaker inferencing endpoints with an advanced use of Identity and Access management.

See the article “Implementing SaaS Tenant Isolation Using Amazon SageMaker Endpoints and IAM

Also, see the talk “Multitenant inference architecture with SageMaker endpoints”.