Multi-accelerator compute dispatch
    232.
    发明授权

    公开(公告)号:US11790590B2

    公开(公告)日:2023-10-17

    申请号:US17218421

    申请日:2021-03-31

    CPC classification number: G06T15/005 G06F9/545 G06T15/80

    Abstract: Techniques for executing computing work by a plurality of chiplets are provided. The techniques include assigning workgroups of a kernel dispatch packet to the chiplets; by each chiplet, executing the workgroups assigned to that chiplet; for each chiplet, upon completion of all workgroups assigned to that chiplet for the kernel dispatch packet, notifying the other chiplets of such completion; and upon completion of all workgroups of the kernel dispatch packet, notifying a client of such completion and proceeding to a subsequent kernel dispatch packet.

    Padded vectorization with compile time known masks

    公开(公告)号:US11789734B2

    公开(公告)日:2023-10-17

    申请号:US16537460

    申请日:2019-08-09

    Abstract: A computing system includes a processing unit and a memory storing instructions that, when executed by the processor, cause the processor to receive program source code in a compiler, identify in the program source code a set of operations for vectorizing, where each operation in the set of operations specifies a set of one or more operands, in response to identifying the set of operations, vectorize the set of operations by, based on the number of operations in the set of operations and a total number of lanes in a first vector register, generating a mask indicating a first unmasked lane and a first masked lane in the first vector register, based on the mask, generating a set of one or more instructions for loading into the first unmasked lane a first operand of a first operation of the set of operations, and loading the first operand into the first masked lane.

    Lid Carveouts for Processor Lighting
    234.
    发明公开

    公开(公告)号:US20230324967A1

    公开(公告)日:2023-10-12

    申请号:US17704862

    申请日:2022-03-25

    CPC classification number: G06F1/206

    Abstract: Package lids with carveouts configured to expose lights directly connected to an internal component of a processor are described. Lid carveouts are configured to precisely align and mechanically secure a cooling device to the package lid by receiving protrusions of the cooling device via a press fit connection, while maintaining visibility of lights directly connected to processor internal components when the cooling device is connected. Lid carveouts are further configured to expose one or more connectors disposed on a processor surface that supports its internal component. When contacted by corresponding connectors of an auxiliary device, such as a light not integrated into a processor package or a cooling device, the lid carveouts enable direct connections between the package’s internal components and the auxiliary device.

    Efficient and low latency memory access scheduling

    公开(公告)号:US11782640B2

    公开(公告)日:2023-10-10

    申请号:US17218703

    申请日:2021-03-31

    CPC classification number: G06F3/0659 G06F3/0611 G06F3/0653 G06F3/0679

    Abstract: A memory controller includes a command queue that receives and stores decoded memory commands and information related thereto including information indicating a type, a priority, an age, and a region of a memory system for a corresponding decoded memory command, and an arbiter coupled to the command queue and picks selected decoded memory commands among the decoded memory commands from the command queue for dispatch to the memory system by comparing the priority and the age for decoded memory commands having a first type. The arbiter detects when the command queue receives a decoded memory command of a second type opposite to said first type that accesses a first memory region of the memory system, and in response elevates at least one of the priority and the age of a decoded command of the first type that accesses the first memory region already stored in the command queue.

Patent Agency Ranking