Patent search ap:("Xilinx Page Inc.") AND inv:"Jaideep Dastidar"

21.

发明授权
Cache coherent acceleration function virtualization with hierarchical partition hardware circuity in accelerator 有权

公开(公告)号：US11983575B2

公开(公告)日：2024-05-14

申请号：US17903084

申请日：2022-09-06

Applicant: XILINX, INC.

Inventor： Millind Mittal , Jaideep Dastidar

IPC: G06F9/50 , G06F9/38 , G06F9/455 , G06F12/0815

CPC classification number: G06F9/5061 , G06F9/3877 , G06F9/45545 , G06F9/45558 , G06F9/5044 , G06F12/0815 , G06F2009/4557

Abstract: The embodiments herein describe a virtualization framework for cache coherent accelerators where the framework incorporates a layered approach for accelerators in their interactions between a cache coherent protocol layer and the functions performed by the accelerator. In one embodiment, the virtualization framework includes a first layer containing the different instances of accelerator functions (AFs), a second layer containing accelerator function engines (AFE) in each of the AFs, and a third layer containing accelerator function threads (AFTs) in each of the AFEs. Partitioning the hardware circuitry using multiple layers in the virtualization framework allows the accelerator to be quickly re-provisioned in response to requests made by guest operation systems or virtual machines executing in a host. Further, using the layers to partition the hardware permits the host to re-provision sub-portions of the accelerator while the remaining portions of the accelerator continue to operate as normal.

22.

发明授权
Disaggregated switch control path with direct-attached dispatch 有权

公开(公告)号：US11386031B2

公开(公告)日：2022-07-12

申请号：US16894446

申请日：2020-06-05

Applicant: XILINX, INC.

Inventor： Millind Mittal , Jaideep Dastidar

IPC: G06F13/40 , G06F13/42

Abstract: Embodiments herein describe techniques for separating data transmitted between I/O functions in an integrated component and a host into separate data paths. In one embodiment, data packets are transmitted using a direct data path that bypasses a switch in the integrated component. In contrast, configuration packets (e.g., hot-swap, hot-add, hot-remove data, some types of descriptors, etc.) are transmitted to the switch which then forwards the configuration packets to their destination. The direct path for the data packets does not rely on switch connectivity (and its accompanying latency) to transport bandwidth sensitive traffic between the host and the I/O functions, and instead avoids (e.g., bypasses) the bandwidth, resource, store/forward, and latency properties of the switch. Meanwhile, the software compatibility attributes, such as hot plug attributes (which are not latency or bandwidth sensitive), continue to be supported by using the switch to provide a configuration data path.

23.

发明授权
Fine-grained multi-tenant cache management 有权

公开(公告)号：US11372769B1

公开(公告)日：2022-06-28

申请号：US16555138

申请日：2019-08-29

Applicant: XILINX, INC.

Inventor： Millind Mittal , Jaideep Dastidar

IPC: G06F12/0891 , G06F12/0815 , G06F9/50 , G06F3/06

Abstract: The embodiments herein describe a multi-tenant cache that implements fine-grained allocation of the entries within the cache. Each entry in the cache can be allocated to a particular tenant—i.e., fine-grained allocation—rather than having to assign all the entries in a way to a particular tenant. If the tenant does not currently need those entries (which can be tracked using counters), the entries can be invalidated (i.e., deallocated) and assigned to another tenant. Thus, fine-grained allocation provides a flexible allocation of entries in a hardware cache that permits an administrator to reserve any number of entries for a particular tenant, but also permit other tenants to use this bandwidth when the reserved entries are not currently needed by the tenant.

24.

发明授权
Delegated snoop protocol 有权

公开(公告)号：US11093394B1

公开(公告)日：2021-08-17

申请号：US16560435

申请日：2019-09-04

Applicant: XILINX, INC.

Inventor： Millind Mittal , Jaideep Dastidar

IPC: G06F12/0815

Abstract: An example Cache-Coherent Non-Uniform Memory Access (CC-NUMA) system includes: one or more fabric switches; a home agent coupled to the one or more fabric switches; first and second response agents coupled to the fabric switches; wherein the home agent is configured to send a delegated snoop message to the first response agent, the delegated snoop message instructing the first response agent to snoop the second response agent; wherein the first response agent is configured to snoop the second response agent in response to the delegated snoop message; and wherein the first and second response agents are configured to perform a cache-to-cache transfer during the snoop.

25.

发明授权
Machine learning model updates to ML accelerators 有权

公开(公告)号：US10817462B1

公开(公告)日：2020-10-27

申请号：US16396540

申请日：2019-04-26

Applicant: Xilinx, Inc.

Inventor： Jaideep Dastidar , Millind Mittal

IPC: G06F15/78 , H04L12/66 , G06N20/00 , G06F13/42 , G06F3/06 , G06F9/54

Abstract: Examples herein describe a peripheral I/O device with a hybrid gateway that permits the device to have both I/O and coherent domains. As a result, the compute resources in the coherent domain of the peripheral I/O device can communicate with the host in a similar manner as CPU-to-CPU communication in the host. The dual domains in the peripheral I/O device can be leveraged for machine learning (ML) applications. While an I/O device can be used as an ML accelerator, these accelerators previously only used an I/O domain. In the embodiments herein, compute resources can be split between the I/O domain and the coherent domain where a ML engine is in the I/O domain and a ML model is in the coherent domain. An advantage of doing so is that the ML model can be coherently updated using a reference ML model stored in the host.

26.

发明授权
Spatial distribution in a 3D data processing unit 有权

公开(公告)号：US11709790B2

公开(公告)日：2023-07-25

申请号：US17184456

申请日：2021-02-24

Applicant: XILINX, INC.

Inventor： Jaideep Dastidar

IPC: G06F13/42 , G06F21/60

CPC classification number: G06F13/4208 , G06F21/602

Abstract: The embodiments herein describe a 3D SmartNIC that spatially distributes compute, storage, or network functions in three dimensions using a plurality of layers. That is, unlike current SmartNIC that can perform acceleration functions in a 2D, a 3D Smart can distribute these functions across multiple stacked layers, where each layer can communicate directly or indirectly with the other layers.

27.

发明授权
Machine learning model updates to ML accelerators 有权

公开(公告)号：US11586578B1

公开(公告)日：2023-02-21

申请号：US17080642

申请日：2020-10-26

Applicant: XILINX, INC.

Inventor： Jaideep Dastidar

IPC: G06F15/78 , G06N20/00 , G06F9/54 , G06F13/42 , H04L12/66 , G06F3/06

Abstract: Examples herein describe a peripheral I/O device with a hybrid gateway that permits the device to have both I/O and coherent domains. As a result, the compute resources in the coherent domain of the peripheral I/O device can communicate with the host in a similar manner as CPU-to-CPU communication in the host. The dual domains in the peripheral I/O device can be leveraged for machine learning (ML) applications. While an I/O device can be used as an ML accelerator, these accelerators previously only used an I/O domain. In the embodiments herein, compute resources can be split between the I/O domain and the coherent domain where a ML engine is in the I/O domain and a ML model is in the coherent domain. An advantage of doing so is that the ML model can be coherently updated using a reference ML model stored in the host.

28.

发明授权
Hardware coherent computational expansion memory 有权

公开(公告)号：US11556344B2

公开(公告)日：2023-01-17

申请号：US17035484

申请日：2020-09-28

Applicant: XILINX, INC.

Inventor： Millind Mittal , Jaideep Dastidar

IPC: G06F9/38 , G06F9/30 , G06F9/54 , G06F12/0891

Abstract: Embodiments herein describe transferring ownership of data (e.g., cachelines or blocks of data comprising multiple cachelines) from a host to hardware in an I/O device. In one embodiment, the host and I/O device (e.g., an accelerator) are part of a cache-coherent system where ownership of data can be transferred from a home agent (HA) in the host to a local HA in the I/O device—e.g., a computational slave agent (CSA). That way, a function on the I/O device (e.g., an accelerator function) can request data from the local HA without these requests having to be sent to the host HA. Further, the accelerator function can indicate whether the local HA tracks the data on a cacheline-basis or by a data block (e.g., multiple cachelines). This provides flexibility that can reduce overhead from tracking the data, depending on the function's desired use of the data.

29.

发明授权
Logical transport over a fixed PCIE physical transport network 有权

公开(公告)号：US11477049B2

公开(公告)日：2022-10-18

申请号：US16053384

申请日：2018-08-02

Applicant: Xilinx, Inc.

Inventor： Millind Mittal , Kiran S. Puranik , Jaideep Dastidar

IPC: H04L12/54 , H04L69/22 , H04L12/70

Abstract: A method and a system for transparently overlaying a logical transport network over an existing physical transport network is disclosed. The system designates a virtual channel located in a first transaction layer of a network conforming to a first network protocol. The system assembles a transaction layer packet in a second logical transaction layer of a second network protocol that is also recognizable by the first transaction layer. The system transfers the transaction layer packet from the second transaction layer to the virtual channel. The system transmits the transaction layer packet over the first transaction layer using the designated virtual channel over the network.

30.

发明授权
Adaptive integrated programmable device platform 有权

公开(公告)号：US11063594B1

公开(公告)日：2021-07-13

申请号：US16872009

申请日：2020-05-11

Applicant: Xilinx, Inc.

Inventor： Sagheer Ahmad , Jaideep Dastidar , Brian C. Gaide , Juan J. Noguera Serra , Ian A. Swarbrick

IPC: H03K19/17728 , H03K19/17736 , H03K19/17704

Abstract: An integrated circuit (IC) includes a first interface configured for operation with a plurality of tenants implemented concurrently in the integrated circuit, wherein the plurality of tenants communicate with a host data processing system using the first interface. The IC includes a second interface configured for operation with the plurality of tenants, wherein the plurality of tenants communicate with one or more network nodes via a network using the second interface. The IC can include a programmable logic circuitry configured for operation with the plurality of tenants, wherein the programmable logic circuitry implements one or more hardware accelerated functions for the plurality of tenants and routes data between the first interface and the second interface. The first interface, the second interface, and the programmable logic circuitry are configured to provide isolation among the plurality of tenants.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification