Invention Grant
- Patent Title: Auto-scaling hosted machine learning models for production inference
-
Application No.: US15822061Application Date: 2017-11-24
-
Publication No.: US11126927B2Publication Date: 2021-09-21
- Inventor: Stefano Stefani , Steven Andrew Loeppky , Thomas Albert Faulhaber, Jr. , Craig Wiley , Edo Liberty
- Applicant: Amazon Technologies, Inc.
- Applicant Address: US WA Seattle
- Assignee: Amazon Technologies, Inc.
- Current Assignee: Amazon Technologies, Inc.
- Current Assignee Address: US WA Seattle
- Agency: NDWE LLP
- Main IPC: G06N3/02
- IPC: G06N3/02 ; G06F9/455 ; G06N20/00 ; G06F9/50 ; G06Q10/10 ; H04L12/24 ; H04L12/26 ; H04L29/12

Abstract:
Techniques for auto-scaling hosted machine learning models for production inference are described. A machine learning model can be deployed in a hosted environment such that the infrastructure supporting the machine learning model scales dynamically with demand so that performance is not impacted. The model can be auto-scaled using reactive techniques or predictive techniques.
Public/Granted literature
- US20190164080A1 AUTO-SCALING HOSTED MACHINE LEARNING MODELS FOR PRODUCTION INFERENCE Public/Granted day:2019-05-30
Information query