Invention Application
- Patent Title: AUTO-SCALING HOSTED MACHINE LEARNING MODELS FOR PRODUCTION INFERENCE
-
Application No.: US15822061Application Date: 2017-11-24
-
Publication No.: US20190164080A1Publication Date: 2019-05-30
- Inventor: Stefano STEFANI , Steven Andrew LOEPPKY , Thomas Albert FAULHABER, JR. , Craig WILEY , Edo LIBERTY
- Applicant: Amazon Technologies, Inc.
- Main IPC: G06N99/00
- IPC: G06N99/00 ; G06F9/455

Abstract:
Techniques for auto-scaling hosted machine learning models for production inference are described. A machine learning model can be deployed in a hosted environment such that the infrastructure supporting the machine learning model scales dynamically with demand so that performance is not impacted. The model can be auto-scaled using reactive techniques or predictive techniques.
Public/Granted literature
- US11126927B2 Auto-scaling hosted machine learning models for production inference Public/Granted day:2021-09-21
Information query