Auto scaling a distributed predictive analytics system with machine learning

    公开(公告)号:US11481598B2

    公开(公告)日:2022-10-25

    申请号:US15822439

    申请日:2017-11-27

    IPC分类号: G06N3/04 G06N3/063 G06N3/08

    摘要: A computer-implemented method for creating an auto-scaled predictive analytics model includes determining, via a processor, whether a queue size of a service master queue is greater than zero. Responsive to determining that the queue size is greater than zero, the processor fetches a count of requests in a plurality of requests in the service master queue and a type for each of the requests. The processor derives a value for time required for each of the requests and retrieves a number of available processing nodes based on the time required for each of the requests. The processor then auto-scales a processing node number responsive to determining that a total execution time for all of the requests in the plurality of requests exceeds a predetermined time value and outputs an auto-scaled predictive analytics model based on the processing node number and queue size.

    AUTO SCALING A DISTRIBUTED PREDICTIVE ANALYTICS SYSTEM WITH MACHINE LEARNING

    公开(公告)号:US20190164033A1

    公开(公告)日:2019-05-30

    申请号:US15822439

    申请日:2017-11-27

    IPC分类号: G06N3/04 G06N3/08 G06N3/063

    摘要: A computer-implemented method for creating an auto-scaled predictive analytics model includes determining, via a processor, whether a queue size of a service master queue is greater than zero. Responsive to determining that the queue size is greater than zero, the processor fetches a count of requests in a plurality of requests in the service master queue, and a type for each of the requests. The processor derives a value for time required for each of the requests, and retrieves a number of available processing nodes based on the time required for each of the requests. The processor then auto-scales a processing node number responsive to determining that a total execution time for all of the requests in the plurality of requests exceeds a predetermined time value, and outputs an auto-scaled predictive analytics model based on the processing node number and queue size.