-
公开(公告)号:US20230053537A1
公开(公告)日:2023-02-23
申请号:US17819879
申请日:2022-08-15
Applicant: Xilinx, Inc.
Inventor: Baris Ozgul , David Clarke , Peter McColgan , Stephan Munz , Dylan Stuart , Pedro Miguel Parola Duarte , Juan J. Noguera Serra
Abstract: Using multiple overlays with a data processing array includes loading an application in a data processing array. The data processing array includes a plurality of compute tiles each having a processor. The application specifies kernels executable by the processors and implements stream channels that convey data to the plurality of compute tiles. During runtime of the application, a plurality of overlays are sequentially implemented in the data processing array. Each overlay implements a different mode of data movement in the data processing array via the stream channels. For each overlay implemented, a workload is performed by moving data to the plurality of compute tiles based on the respective mode of data movement.
-
公开(公告)号:US11567881B1
公开(公告)日:2023-01-31
申请号:US15944602
申请日:2018-04-03
Applicant: Xilinx, Inc.
Inventor: Goran H. K. Bilski , David Clarke , Baris Ozgul , Jan Langer , Juan J. Noguera Serra
Abstract: A device may include an array of data processing engines (DPEs) on a die and an event broadcast network. Each of the DPEs includes a core, a memory module, event logic in at least one of the core or the memory module, and an event broadcast circuitry coupled to the event logic. The event logic is capable of detecting an occurrence of one or more events in the core or the memory module. The event broadcast circuitry is capable of receiving an indication of a detected event detected by the event logic. The event broadcast network includes interconnections between the event broadcast circuitry of the DPEs. Detected events can trigger or initiate various responses, such as debugging, tracing, and profiling.
-
公开(公告)号:US11296707B1
公开(公告)日:2022-04-05
申请号:US17196574
申请日:2021-03-09
Applicant: Xilinx, Inc.
Inventor: Javier Cabezas Rodriguez , Juan J. Noguera Serra , David Clarke , Sneha Bhalchandra Date , Tim Tuan , Peter McColgan , Jan Langer , Baris Ozgul
IPC: H03K19/1776 , H03K19/17704 , H03K19/17768 , H03K19/17758 , H03K19/17796
Abstract: An integrated circuit can include a data processing engine (DPE) array having a plurality of tiles. The plurality of tiles can include a plurality of DPE tiles, wherein each DPE tile includes a stream switch, a core configured to perform operations, and a memory module. The plurality of tiles can include a plurality of memory tiles, wherein each memory tile includes a stream switch, a direct memory access (DMA) engine, and a random-access memory. The DMA engine of each memory tile may be configured to access the random-access memory within the same memory tile and the random-access memory of at least one other memory tile. Selected ones of the plurality of DPE tiles may be configured to access selected ones of the plurality of memory tiles via the stream switches.
-
公开(公告)号:US11443091B1
公开(公告)日:2022-09-13
申请号:US16945006
申请日:2020-07-31
Applicant: Xilinx, Inc.
Inventor: Peter McColgan , Baris Ozgul , David Clarke , Tim Tuan , Juan J. Noguera Serra , Goran H. K. Bilski , Jan Langer , Sneha Bhalchandra Date , Stephan Munz , Jose Marques
IPC: G06F30/343 , G06F9/30 , G06F30/398 , G06F30/33
Abstract: An integrated circuit includes a plurality of data processing engines (DPEs) DPEs. Each DPE may include a core configured to perform computations. A first DPE of the plurality of DPEs includes a first core coupled to an input cascade connection of the first core. The input cascade connection is directly coupled to a plurality of source cores of the plurality of DPEs. The input cascade connection includes a plurality of inputs, wherein each of the plurality of inputs is connected to a cascade output of a different one of the plurality of source cores. The input cascade connection is programmable to enable a selected one of the plurality of inputs.
-
公开(公告)号:US11372803B2
公开(公告)日:2022-06-28
申请号:US15944408
申请日:2018-04-03
Applicant: Xilinx, Inc.
Inventor: Goran H. K. Bilski , Juan J. Noguera Serra , Baris Ozgul , Jan Langer , David Clarke , Sneha Bhalchandra Date
IPC: G06F15/80 , G06F13/40 , G06F15/173 , G06F13/16
Abstract: An example data processing engine (DPE) for a DPE array in an integrated circuit (IC) includes: a core; a memory including a data memory and a program memory, the program memory coupled to the core, the data memory coupled to the core and including at least one connection to a respective at least one additional core external to the DPE; support circuitry including hardware synchronization circuitry and direct memory access (DMA) circuitry each coupled to the data memory; streaming interconnect coupled to the DMA circuitry and the core; and memory-mapped interconnect coupled to the core, the memory, and the support circuitry.
-
公开(公告)号:US11113223B1
公开(公告)日:2021-09-07
申请号:US15944490
申请日:2018-04-03
Applicant: Xilinx, Inc.
Inventor: Peter McColgan , Goran H K Bilski , Juan J. Noguera Serra , Jan Langer , Baris Ozgul , David Clarke
Abstract: Examples herein describe techniques for communicating between data processing engines in an array of data processing engines. In one embodiment, the array is a 2D array where each of the DPEs includes one or more cores. In addition to the cores, the data processing engines can include streaming interconnects which transmit streaming data using two different modes: circuit switching and packet switching. Circuit switching establishes reserved point-to-point communication paths between endpoints in the interconnect which routes data in a deterministic manner. Packet switching, in contrast, transmits streaming data that includes headers for routing data within the interconnect in a non-deterministic manner. In one embodiment, the streaming interconnects can have one or more ports configured to perform circuit switching and one or more ports configured to perform packet switching.
-
公开(公告)号:US12067406B2
公开(公告)日:2024-08-20
申请号:US17819879
申请日:2022-08-15
Applicant: Xilinx, Inc.
Inventor: Baris Ozgul , David Clarke , Peter McColgan , Stephan Münz , Dylan Stuart , Pedro Miguel Parola Duarte , Juan J. Noguera Serra
CPC classification number: G06F9/44505 , G06F9/5083 , G06F13/1673 , G06F13/28 , G06F17/16 , G06N3/063
Abstract: Using multiple overlays with a data processing array includes loading an application in a data processing array. The data processing array includes a plurality of compute tiles each having a processor. The application specifies kernels executable by the processors and implements stream channels that convey data to the plurality of compute tiles. During runtime of the application, a plurality of overlays are sequentially implemented in the data processing array. Each overlay implements a different mode of data movement in the data processing array via the stream channels. For each overlay implemented, a workload is performed by moving data to the plurality of compute tiles based on the respective mode of data movement.
-
公开(公告)号:US11972132B2
公开(公告)日:2024-04-30
申请号:US18145810
申请日:2022-12-22
Applicant: Xilinx, Inc.
Inventor: Juan J. Noguera Serra , Goran H K Bilski , Jan Langer , Baris Ozgul , Richard L. Walke , Ralph D. Wittig , Kornelis A. Vissers , Tim Tuan , David Clarke
IPC: G06F3/06 , G06F13/16 , G06F15/173 , G06F15/78
CPC classification number: G06F3/0647 , G06F3/061 , G06F3/0683 , G06F13/1663 , G06F15/17331 , G06F15/7807
Abstract: A device includes a data processing engine array having a plurality of data processing engines organized in a grid having a plurality of rows and a plurality of columns. Each data processing engine includes a core, a memory module including a memory and a direct memory access engine. Each data processing engine includes a stream switch connected to the core, the direct memory access engine, and the stream switch of one or more adjacent data processing engines. Each memory module includes a first memory interface directly coupled to the core in the same data processing engine and one or more second memory interfaces directly coupled to the core of each of the one or more adjacent data processing engines.
-
公开(公告)号:US11730325B2
公开(公告)日:2023-08-22
申请号:US17468346
申请日:2021-09-07
Applicant: XILINX, INC.
Inventor: Peter McColgan , Goran Hk Bilski , Juan J. Noguera Serra , Jan Langer , Baris Ozgul , David Clarke
CPC classification number: A47K11/02 , E04H1/1216 , E04H15/38 , G06F13/4022 , Y02A50/30
Abstract: Examples herein describe techniques for communicating between data processing engines in an array of data processing engines. In one embodiment, the array is a 2D array where each of the DPEs includes one or more cores. In addition to the cores, the data processing engines can include streaming interconnects which transmit streaming data using two different modes: circuit switching and packet switching. Circuit switching establishes reserved point-to-point communication paths between endpoints in the interconnect which routes data in a deterministic manner. Packet switching, in contrast, transmits streaming data that includes headers for routing data within the interconnect in a non-deterministic manner. In one embodiment, the streaming interconnects can have one or more ports configured to perform circuit switching and one or more ports configured to perform packet switching.
-
公开(公告)号:US11520717B1
公开(公告)日:2022-12-06
申请号:US17196669
申请日:2021-03-09
Applicant: Xilinx, Inc.
Inventor: David Clarke , Peter McColgan , Zachary Dickman , Jose Marques , Juan J. Noguera Serra , Tim Tuan , Baris Ozgul , Jan Langer
Abstract: An integrated circuit having a data processing engine (DPE) array can include a plurality of memory tiles. A first memory tile can include a first direct memory access (DMA) engine, a first random-access memory (RAM) connected to the first DMA engine, and a first stream switch coupled to the first DMA engine. The first DMA engine may be coupled to a second RAM disposed in a second memory tile. The first stream switch may be coupled to a second stream switch disposed in the second memory tile.
-
-
-
-
-
-
-
-
-