DEAD SURFACE INVALIDATION
    1.
    发明公开

    公开(公告)号:US20230206384A1

    公开(公告)日:2023-06-29

    申请号:US17563950

    申请日:2021-12-28

    CPC classification number: G06T1/60 G06F12/0891 G06T1/20 G06F2212/455

    Abstract: Systems, apparatuses, and methods for performing dead surface invalidation are disclosed. An application sends draw call commands to a graphics processing unit (GPU) via a driver, with the draw call commands rendering to surfaces. After it is determined that a given surface will no longer be accessed by subsequent draw calls, the application sends a surface invalidation command for the given surface to a command processor of the GPU. After the command processor receives the surface invalidation command, the command processor waits for a shader engine to send a draw call completion message for a last draw call to access the given surface. Once the command processor receives the draw call completion message, the command processor sends a surface invalidation command to a cache to invalidate cache lines for the given surface to free up space in the cache for other data.

    REDUCING VOLTAGE DROOP BY LIMITING ASSIGNMENT OF WORK BLOCKS TO COMPUTE CIRCUITS

    公开(公告)号:US20240320034A1

    公开(公告)日:2024-09-26

    申请号:US18189995

    申请日:2023-03-24

    CPC classification number: G06F9/4881 G06F9/4893 G06F9/5038

    Abstract: An apparatus and method for efficiently managing voltage droop among replicated compute circuits of an integrated circuit. In various implementations, an integrated circuit includes multiple, replicated compute circuits, each including circuitry to process tasks grouped into a work block. When a scheduling window has begun, the scheduler determines a value for a threshold number of idle compute circuits that can be simultaneously activated based on one or more of a number of active compute circuits, an operating clock frequency, a measured operating temperature, a number of pending work blocks, and an application identifier. If the scheduler determines that there is a count of idle compute circuits that is equal to or greater than the threshold number of idle compute circuits, then the scheduler limits the number of idle compute circuits that can be activated at one time to the threshold number.

    Mitigation Of Undershoot And Overshoot On A Power Rail

    公开(公告)号:US20250004516A1

    公开(公告)日:2025-01-02

    申请号:US18346070

    申请日:2023-06-30

    Abstract: An apparatus and method for efficiently managing voltage droop among replicated compute circuits of an integrated circuit. In various implementations, an integrated circuit includes multiple, replicated compute circuits, each with the circuitry of multiple lanes of execution. Control circuitry of the integrated circuit identifies, early in execution pipelines, groups of instructions to be executed by a corresponding compute circuit, and generates a total power consumption estimate for the groups. The control circuitry maintains N previous total power consumption estimates, and stores the N power consumption estimates in staging circuitry referred to as an “instruction history pipeline.” If any differences between total power consumption estimates of different stages of the instruction history pipeline exceeds a corresponding threshold, then the control circuitry reduces, late in the execution pipeline, the rate of instruction execution of computation lanes of a corresponding compute circuit.

    Dead surface invalidation
    4.
    发明授权

    公开(公告)号:US12033239B2

    公开(公告)日:2024-07-09

    申请号:US17563950

    申请日:2021-12-28

    CPC classification number: G06T1/60 G06F12/0891 G06T1/20 G06F2212/455

    Abstract: Systems, apparatuses, and methods for performing dead surface invalidation are disclosed. An application sends draw call commands to a graphics processing unit (GPU) via a driver, with the draw call commands rendering to surfaces. After it is determined that a given surface will no longer be accessed by subsequent draw calls, the application sends a surface invalidation command for the given surface to a command processor of the GPU. After the command processor receives the surface invalidation command, the command processor waits for a shader engine to send a draw call completion message for a last draw call to access the given surface. Once the command processor receives the draw call completion message, the command processor sends a surface invalidation command to a cache to invalidate cache lines for the given surface to free up space in the cache for other data.

    DYNAMIC VECTOR LANE BROADCASTING
    5.
    发明公开

    公开(公告)号:US20240085970A1

    公开(公告)日:2024-03-14

    申请号:US17932155

    申请日:2022-09-14

    CPC classification number: G06F1/3237 G06F9/4843

    Abstract: An apparatus and method for efficient power management of multiple integrated circuits. In various implementations, a computing system includes first partition and a second partition. The second partition includes video pre-processing circuitry that identifies regions of a video frame to be presented on a screen or monitor that don't change or regions that can have one or more of resolution and color accuracy be below a threshold. The first partition includes a parallel data processor with one or more compute units, each with multiple lanes of execution. Based on the identified regions, the first partition generates an execution mask indicating which lanes of the compute units are inactive. The parallel data processor copies result data from the active lanes to outputs of the inactive lanes.

Patent Agency Ranking