Data processing method and apparatus, and computing device

    公开(公告)号:US12124886B2

    公开(公告)日:2024-10-22

    申请号:US17232195

    申请日:2021-04-16

    IPC分类号: G06F9/50 G06F13/28

    摘要: This disclosure provides a data processing method, including: receiving, by a first computing device, a first packet sent by a second computing device, where the first computing device is configured to assist the second computing device in performing service processing, the first computing device is a computing device in a heterogeneous resource pool, the first computing device communicates with the second computing device through a network, the heterogeneous resource pool includes at least one first computing device, and the first packet includes an instruction used to request the first computing device to process to-be-processed data; processing, by the first computing device, the to-be-processed data based on the instruction; and sending, by the first computing device, a second packet to the second computing device, where the second packet includes a processing result of the to-be-processed data.

    Error detection and recovery when streaming data

    公开(公告)号:US12111721B2

    公开(公告)日:2024-10-08

    申请号:US18490675

    申请日:2023-10-19

    申请人: Apple Inc.

    IPC分类号: G06F11/07 G06F13/28

    CPC分类号: G06F11/0793 G06F13/28

    摘要: Systems, apparatuses, and methods for error detection and recovery when streaming data are described. A system includes one or more companion direct memory access (DMA) subsystems for transferring data. When an error is detected for a component of the companion DMA subsystem(s), the operations performed by the other components need to gracefully adapt to this error so that operations face only a minimal disruption. For example, while one or more consumers are still consuming a first frame, a companion router receives an indication of an error for a second frame, causing the companion router to send a router frame abort message to a route manager. In response, the route manager waits until the consumer(s) are consuming the second frame before sending them a frame abort message. The consumer(s) flush their pipeline and transition to an idle state waiting for a third frame after receiving the frame abort message.

    MATRIX COMPRESSION ACCELERATOR SYSTEM AND METHOD

    公开(公告)号:US20240333304A1

    公开(公告)日:2024-10-03

    申请号:US18738203

    申请日:2024-06-10

    摘要: A matrix compression/decompression accelerator (MCA) system/method that coordinates lossless data compression (LDC) and lossless data decompression (LDD) transfers between an external data memory (EDM) and a local data memory (LDM) is disclosed. The system implements LDC using a 2D-to-1D transformation of 2D uncompressed data blocks (2DU) within LDM to generate 1D uncompressed data blocks (1DU). The 1DU is then compressed to generate a 1D compressed superblock (CSB) in LDM. This LDM CSB may then be written to EDM with a reduced number of EDM bus cycles. The system implements LDD using decompression of CSB data retrieved from EDM to generate a 1D decompressed data block (1DD) in LDM. A 1D-to-2D transformation is then applied to the LDM 1DD to generate a 2D decompressed data block (2DD) in LDM. This 2DD may then be operated on by a matrix compute engine (MCE) using a variety of function operators.

    INPUT AND OUTPUT SPATIAL CROPPING OPERATIONS IN NEURAL PROCESSOR CIRCUITS

    公开(公告)号:US20240330217A1

    公开(公告)日:2024-10-03

    申请号:US18616772

    申请日:2024-03-26

    申请人: Apple Inc.

    IPC分类号: G06F13/28

    CPC分类号: G06F13/28 G06F2213/2806

    摘要: An SoC circuit includes a neural processor circuit coupled to a CPU. The neural processor circuit includes neural engines, a data processor DMA circuit, a system memory, and a data processor circuit. The CPU is configured to execute a compiler, which is in turn configured to determine to perform a mode of spatial cropping and the associated crop offset. The neural processor circuit is configured to support arbitrary cropping in the x and y dimensions. The compiler is configured to generate task descriptor(s), the task descriptor(s) distributed to components of the neural processor circuit. The data processor DMA circuit is configured to fetch and format data corresponding to the crop from a source to the buffer. The buffer is configured to realign the data according to the crop origin for broadcast to the neural engines. The neural engines is configured to perform a computation operation which uses the cropped data.

    DIRECT MEMORY ACCESS SYSTEM WITH READ REASSEMBLY CIRCUIT

    公开(公告)号:US20240330216A1

    公开(公告)日:2024-10-03

    申请号:US18193129

    申请日:2023-03-30

    申请人: Xilinx, Inc.

    IPC分类号: G06F13/28

    CPC分类号: G06F13/28 G06F2213/28

    摘要: A direct memory access (DMA) system includes a plurality of read circuits and a switch coupled to a plurality of data port controllers configured to communicate with one or more data processing systems. The DMA system includes a read scheduler circuit coupled to the plurality of read circuits and the switch. The read scheduler circuit is configured to receive read requests from the plurality of read circuits, request allocation of entries of a data memory for the read requests, and submit the read requests to the one more data processing systems via the switch. The DMA system includes a read reassembly circuit coupled to the plurality of read circuits, the switch, and the read scheduler circuit. The read reassembly circuit is configured to reorder read completion data received from the switch for the read requests and provide read completion data, as reordered, to the plurality of read circuits.