METHODS AND APPARATUS TO FACILITATE TILE-BASED GPU MACHINE LEARNING ACCELERATION

    公开(公告)号:US20210240524A1

    公开(公告)日:2021-08-05

    申请号:US16779275

    申请日:2020-01-31

    Abstract: The present disclosure relates to methods and apparatus for machine learning processing. For example, disclosed techniques facilitate tile-based GPU machine learning acceleration. Aspects of the present disclosure can determine a tile size based on a memory size of a first memory and a job input size associated with executing a computational job. In some examples, the computational job may be one of a quantity of computational jobs configured to execute a machine learning primitive. Aspects of the present disclosure can also load, based on the tile size, input data associated with a batch of computational jobs from a second memory to the first memory. Further, aspects of the present disclosure can generate batch output data by executing the batch of computational jobs using the input data loaded to the first memory. Additionally, aspects of the present disclosure can store the generated batch output data to the second memory.

Patent Agency Ranking