LOW-LATENCY SYSTEMS TO TRIGGER REMEDIAL ACTIONS IN DATA CENTERS BASED ON TELEMETRY DATA

    公开(公告)号:US20210232472A1

    公开(公告)日:2021-07-29

    申请号:US16773390

    申请日:2020-01-27

    Abstract: Systems and methods described herein reduce latency between the time at which telemetry data is collected in data center and the time at which a remedial action is triggered to address an event that can be predicted based on the telemetry data. Telemetry data is collected in a data center and used to create training data for a machine-learning model configured to predict events in the data center based on patterns in the telemetry data. The machine-learning model is stored at an edge appliance in the data center. Incoming telemetry data can be converted into an input instance that is input into the machine learning model. The machine-learning model generates an output score for the input instance. The output score provides information that indicates whether a remedial action should be taken in the data center to achieve a desired outcome. If a remedial action should be taken, the edge device sends a signal to trigger the remedial action within the data center.

    GENERATION OF EXECUTABLE FILES CORRESPONDING TO NEURAL NETWORK MODELS

    公开(公告)号:US20200242189A1

    公开(公告)日:2020-07-30

    申请号:US16260331

    申请日:2019-01-29

    Abstract: In an example, a neural network program corresponding to a neural network model is received. The neural network program includes matrices, vectors, and matrix-vector multiplication (MVM) operations. A computation graph corresponding to the neural network model is generated. The computation graph includes a plurality of nodes, each node representing a MVM operation, a matrix, or a vector. Further, a class model corresponding to the neural network model is populated with a data structure pointing to the computation graph. The computation graph is traversed based on the class model. Based on the traversal, the plurality of MVM operations are assigned to MVM units of a neural network accelerator. Each MVM unit can perform a MVM operation. Based on assignment of the plurality of MVM operations, an executable file is generated for execution by the neural network accelerator.

    Sparse matrix vector multiplication with a matrix vector multiplication unit

    公开(公告)号:US10726096B2

    公开(公告)日:2020-07-28

    申请号:US16159578

    申请日:2018-10-12

    Abstract: Systems and methods are provided for sparse matrix vector multiplication with a matrix vector multiplication unit. The method includes partitioning a sparse matrix of entries into a plurality of sub-matrices; mapping each of the sub-matrices to one of a plurality of respective matrix vector multiplication engines; partitioning an input vector into a plurality of sub-vectors; computing, via each matrix vector multiplication engine, a plurality of intermediate result vectors each resulting from a multiplication of one of the sub-matrices and one of the sub-vectors; for each set of rows of the sparse matrix, adding elementwise the intermediate result vectors to produce a plurality of result sub-vectors; and concatenating the result sub-vectors to form a result vector.

Patent Agency Ranking