SUPER-TILING IN NEURAL NETWORK PROCESSING TO ENABLING ANALYTICS AT LOWER MEMORY SPEED

    公开(公告)号:US20200272892A1

    公开(公告)日:2020-08-27

    申请号:US16797871

    申请日:2020-02-21

    摘要: Techniques including receiving a first set of values for processing by a machine learning (ML) network, storing a first portion of the first set of values in an on-chip memory, processing the first portion of the first set of values in a first layer of the ML network to generate a second portion of a second set of values, overwriting the stored first portion with the generated second portion, processing the second portion in a second layer of the ML network to generate a third portion of a third set of values, storing the third portion, repeating the steps of storing the first portion, processing the first portion, overwriting the stored first portion, processing the second portion, and storing the third portion for a fourth portion of the first set of values until all portions of the first set of values are processed to generate the third set of values.

    RECONFIGURABLE EXECUTION OF MACHINE LEARNING NETWORKS

    公开(公告)号:US20230064481A1

    公开(公告)日:2023-03-02

    申请号:US17463341

    申请日:2021-08-31

    IPC分类号: G06N20/00 G06K9/62

    摘要: An electronic device, comprising one or more processors, wherein the one or more processors are configured to execute instructions causing the one or more processors to: receive a machine learning (ML) model and execution information associated with the ML model, wherein the execution information including first execution data indicating how to execute the ML model optimized based on a first performance criterion, and second execution data execution data indicating how to execute the ML model optimized based on a second performance criteria, the second performance criterion different from the first performance criteria; execute the ML model based on the first execution data; determine to execute the ML model based on the second execution data; and execute the ML model based on the second execution data.