专利检索 ap:"Sameer Kumar" 第 1 页

1.

发明授权
Message passing with queues and channels 失效
标题翻译：消息传递与队列和通道

公开(公告)号：US08543722B2

公开(公告)日：2013-09-24

申请号：US12750363

申请日：2010-03-30

申请人： Gabor J. Dozsa , Philip Heidelberger , Sameer Kumar , Joseph D. Ratterman , Burkhard Steinmacher-Burow , Robert W. Wisniewski

发明人： Gabor J. Dozsa , Philip Heidelberger , Sameer Kumar , Joseph D. Ratterman , Burkhard Steinmacher-Burow , Robert W. Wisniewski

IPC分类号： G06F15/16

CPC分类号： G06F9/546

摘要： In an embodiment, a send thread receives an identifier that identifies a destination node and a pointer to data. The send thread creates a first send request in response to the receipt of the identifier and the data pointer. The send thread selects a selected channel from among a plurality of channels. The selected channel comprises a selected hand-off queue and an identification of a selected message unit. Each of the channels identifies a different message unit. The selected hand-off queue is randomly accessible. If the selected hand-off queue contains an available entry, the send thread adds the first send request to the selected hand-off queue. If the selected hand-off queue does not contain an available entry, the send thread removes a second send request from the selected hand-off queue and sends the second send request to the selected message unit.

摘要翻译： 在一个实施例中，发送线程接收标识目的地节点的标识符和指向数据的指针。响应于接收到标识符和数据指针，发送线程创建第一个发送请求。发送线程从多个信道中选择所选择的信道。所选择的信道包括所选择的切换队列和所选择的消息单元的标识。每个通道标识不同的消息单元。所选择的切换队列是随机访问的。如果所选择的切换队列包含可用条目，则发送线程将第一个发送请求添加到所选择的切换队列。如果所选择的切换队列不包含可用条目，则发送线程从所选择的切换队列中移除第二发送请求，并将第二发送请求发送到所选择的消息单元。

2.

发明申请
MULTI-PETASCALE HIGHLY EFFICIENT PARALLEL SUPERCOMPUTER 有权
标题翻译：多层高效平行超级计算机

公开(公告)号：US20110219208A1

公开(公告)日：2011-09-08

申请号：US13004007

申请日：2011-01-10

申请人： Sameh Asaad , Ralph E. Bellofatto , Michael A. Blocksome , Matthias A. Blumrich , Peter Boyle , Jose R. Brunheroto , Dong Chen , Chen-Yong Cher , George L. Chiu , Norman Christ , Paul W. Coteus , Kristan D. Davis , Gabor J. Dozsa , Alexandre E. Eichenberger , Noel A. Eisley , Matthew R. Ellavsky , Kahn C. Evans , Bruce M. Fleischer , Thomas W. Fox , Alan Gara , Mark E. Giampapa , Thomas M. Gooding , Michael K. Gschwind , John A. Gunnels , Shawn A. Hall , Rudolf A. Haring , Philip Heidelberger , Todd A. Inglett , Brant L. Knudson , Gerard V. Kopcsay , Sameer Kumar , Amith R. Mamidala , James A. Marcella , Mark G. Megerian , Douglas R. Miller , Samuel J. Miller , Adam J. Muff , Michael B. Mundy , John K. O'Brien , Kathryn M. O'Brien , Martin Ohmacht , Jeffrey J. Parker , Ruth J. Poole , Joseph D. Ratterman , Valentina Salapura , David L. Satterfield , Robert M. Senger , Brian Smith , Burkhard Steinmacher-Burow , William M. Stockdell , Craig B. Stunkel , Krishnan Sugavanam , Yutaka Sugawara , Todd E. Takken , Barry M. Trager , James L. Van Oosten , Charles D. Wait , Robert E. Walkup , Alfred T. Watson , Robert W. Wisniewski , Peng Wu

发明人： Sameh Asaad , Ralph E. Bellofatto , Michael A. Blocksome , Matthias A. Blumrich , Peter Boyle , Jose R. Brunheroto , Dong Chen , Chen-Yong Cher , George L. Chiu , Norman Christ , Paul W. Coteus , Kristan D. Davis , Gabor J. Dozsa , Alexandre E. Eichenberger , Noel A. Eisley , Matthew R. Ellavsky , Kahn C. Evans , Bruce M. Fleischer , Thomas W. Fox , Alan Gara , Mark E. Giampapa , Thomas M. Gooding , Michael K. Gschwind , John A. Gunnels , Shawn A. Hall , Rudolf A. Haring , Philip Heidelberger , Todd A. Inglett , Brant L. Knudson , Gerard V. Kopcsay , Sameer Kumar , Amith R. Mamidala , James A. Marcella , Mark G. Megerian , Douglas R. Miller , Samuel J. Miller , Adam J. Muff , Michael B. Mundy , John K. O'Brien , Kathryn M. O'Brien , Martin Ohmacht , Jeffrey J. Parker , Ruth J. Poole , Joseph D. Ratterman , Valentina Salapura , David L. Satterfield , Robert M. Senger , Brian Smith , Burkhard Steinmacher-Burow , William M. Stockdell , Craig B. Stunkel , Krishnan Sugavanam , Yutaka Sugawara , Todd E. Takken , Barry M. Trager , James L. Van Oosten , Charles D. Wait , Robert E. Walkup , Alfred T. Watson , Robert W. Wisniewski , Peng Wu

IPC分类号： G06F15/76 , G06F9/06

CPC分类号： G06F13/287 , G06F9/06 , G06F9/3004 , G06F9/30047 , G06F9/3885 , G06F12/0811 , G06F12/0831 , G06F12/0862 , G06F12/0864 , G06F12/1027 , G06F15/17381 , G06F15/17387 , G06F15/76 , G06F15/8069 , G06F2212/1016 , G06F2212/602 , G06F2212/6022 , G06F2212/6024 , G06F2212/6032 , Y02D10/13 , Y02D10/14

摘要： A Multi-Petascale Highly Efficient Parallel Supercomputer of 100 petaOPS-scale computing, at decreased cost, power and footprint, and that allows for a maximum packaging density of processing nodes from an interconnect point of view. The Supercomputer exploits technological advances in VLSI that enables a computing model where many processors can be integrated into a single Application Specific Integrated Circuit (ASIC). Each ASIC computing node comprises a system-on-chip ASIC utilizing four or more processors integrated into one die, with each having full access to all system resources and enabling adaptive partitioning of the processors to functions such as compute or messaging I/O on an application by application basis, and preferably, enable adaptive partitioning of functions in accordance with various algorithmic phases within an application, or if I/O or other processors are underutilized, then can participate in computation or communication nodes are interconnected by a five dimensional torus network with DMA that optimally maximize the throughput of packet communications between nodes and minimize latency.

摘要翻译： 具有100 petaOPS规模计算的多Petascale高效并行超级计算机，其成本，功耗和占地面积都在降低，并且允许从互连角度来看处理节点的最大封装密度。超级计算机利用了VLSI的技术进步，实现了许多处理器可以集成到单个专用集成电路（ASIC）中的计算模型。每个ASIC计算节点包括利用集成到一个管芯中的四个或更多个处理器的片上系统ASIC，每个处理器具有对所有系统资源的完全访问，并且使得处理器能够对诸如计算或消息传递I / O 并且优选地，根据应用内的各种算法阶段实现功能的自适应分割，或者如果I / O或其他处理器未被充分利用，则可以参与计算或通信节点通过五维环面网络互连使用DMA来最大限度地最大化节点之间的分组通信的吞吐量并最小化等待时间。

3.

发明申请
STORE-OPERATE-COHERENCE-ON-VALUE 有权
标题翻译：存储操作相关值

公开(公告)号：US20110179229A1

公开(公告)日：2011-07-21

申请号：US12986652

申请日：2011-01-07

申请人： Dong Chen , Philip Heidelberger , Sameer Kumar , Martin Ohmacht , Burkhard Steinmacher-Burow

发明人： Dong Chen , Philip Heidelberger , Sameer Kumar , Martin Ohmacht , Burkhard Steinmacher-Burow

IPC分类号： G06F12/08

CPC分类号： G06F12/0815 , G06F9/30043 , G06F9/30072 , G06F9/30087 , G06F9/3834 , Y02D10/13

摘要： A system, method and computer program product for performing various store-operate instructions in a parallel computing environment that includes a plurality of processors and at least one cache memory device. A queue in the system receives, from a processor, a store-operate instruction that specifies under which condition a cache coherence operation is to be invoked. A hardware unit in the system runs the received store-operate instruction. The hardware unit evaluates whether a result of the running the received store-operate instruction satisfies the condition. The hardware unit invokes a cache coherence operation on a cache memory address associated with the received store-operate instruction if the result satisfies the condition. Otherwise, the hardware unit does not invoke the cache coherence operation on the cache memory device.

摘要翻译： 一种用于在包括多个处理器和至少一个高速缓冲存储器设备的并行计算环境中执行各种存储操作指令的系统，方法和计算机程序产品。系统中的队列从处理器接收存储操作指令，该指令指定在哪个条件下调用高速缓存一致性操作。系统中的硬件单元运行接收到的存储操作指令。硬件单元评估运行接收到的存储操作指令的结果是否满足条件。如果结果满足条件，则硬件单元调用与接收到的存储操作指令相关联的高速缓存存储器地址的高速缓存一致性操作。否则，硬件单元不会调用高速缓存存储器设备上的高速缓存一致性操作。

4.

发明授权
Method for route optimization with dual mobile IPv4 node in IPv6-only network 有权
标题翻译：仅在IPv6网络中使用双移动IPv4节点进行路由优化的方法

公开(公告)号：US07899055B2

公开(公告)日：2011-03-01

申请号：US11645671

申请日：2006-12-27

申请人： Kishore Mundra , Lakshmi Praba Gurusamy , Sameer Kumar , Ranjitsinh Udaysinh Wable

发明人： Kishore Mundra , Lakshmi Praba Gurusamy , Sameer Kumar , Ranjitsinh Udaysinh Wable

IPC分类号： H04L12/28 , H04L12/56

CPC分类号： H04W8/082 , H04W80/04 , H04W80/045

摘要： A method for route optimization with a dual mobile IPv4 node in an IPv6-only network is provided. The method includes the operations of: receiving a visited IPv6 address from a router when the dual mobile node is connected to the IPv6-only network; updating a home agent with the IPv6 address; deregistering a binding update with a correspondent node via the home agent; updating the correspondent node with an IPv6 address; checking the reachability of packets directly to the correspondent node using its IPv6 address; the mobile node starting sending, to the CN, data packets tunneled in an IPv6 packet once the reachability is verified; and the correspondent node sending tunneled data packets directly to an IPv6 address of the mobile node.

摘要翻译： 提供了一种仅在IPv6网络中使用双移动IPv4节点进行路由优化的方法。该方法包括以下操作：当双移动节点连接到仅IPv6网络时，从路由器接收到访问的IPv6地址; 使用IPv6地址更新归属代理; 通过归属代理注销与通信节点的绑定更新; 用IPv6地址更新对端节点; 使用其IPv6地址将数据包的可达性直接检查到对端节点; 移动节点开始向CN发送一旦IPv6包中隧道的数据包，一旦可达性得到验证，并且对端节点将隧道化数据分组直接发送到移动节点的IPv6地址。

5.

发明申请
Method of Fabrication of On-Chip Heat Pipes and Ancillary Heat Transfer Components 有权
标题翻译：片上热管和辅助传热部件的制造方法

公开(公告)号：US20090085197A1

公开(公告)日：2009-04-02

申请号：US11863477

申请日：2007-09-28

申请人： Sameer Kumar Ajmera , Phillip D. Matz , Stephan Grunow , Satyavolu Srinivas Papa Rao

发明人： Sameer Kumar Ajmera , Phillip D. Matz , Stephan Grunow , Satyavolu Srinivas Papa Rao

IPC分类号： H01L23/34

CPC分类号： H01L23/427 , H01L2924/0002 , H01L2924/00

摘要： The density of components in integrated circuits (ICs) is increasing with time. The density of heat generated by the components is similarly increasing. Maintaining the temperature of the components at reliable operating levels requires increased thermal transfer rates from the components to the IC package exterior. Dielectric materials used in interconnect regions have lower thermal conductivity than silicon dioxide. This invention comprises a heat pipe located in the interconnect region of an IC to transfer heat generated by components in the IC substrate to metal plugs located on the top surface of the IC, where the heat is easily conducted to the exterior of the IC package. Refinements such as a wicking liner or reticulated inner surface will increase the thermal transfer efficiency of the heat pipe. Strengthening elements in the interior of the heat pipe will provide robustness to mechanical stress during IC manufacture.

摘要翻译： 集成电路（IC）中组件的密度随时间而增加。组件产生的热密度同样增加。将组件的温度保持在可靠的操作水平，需要增加从组件到IC封装外部的热传递速率。互连区域中使用的介电材料的热导率低于二氧化硅。本发明包括位于IC的互连区域中的热管，用于将IC基板中的部件产生的热量转移到位于IC顶表面上的金属插头，其中热量易于传导到IC封装的外部。诸如芯吸衬垫或网状内表面的改进将增加热管的热传递效率。热管内部加强元件将为IC制造过程中的机械应力提供坚固耐用性。

6.

发明申请
MESSAGE PASSING WITH A LIMITED NUMBER OF DMA BYTE COUNTERS 失效
标题翻译：消息传递与有限数量的DMA字节计数器

公开(公告)号：US20090007141A1

公开(公告)日：2009-01-01

申请号：US11768813

申请日：2007-06-26

申请人： Michael Blocksome , Dong Chen , Mark E. Giampapa , Philip Heidelberger , Sameer Kumar , Jeffrey J. Parker

发明人： Michael Blocksome , Dong Chen , Mark E. Giampapa , Philip Heidelberger , Sameer Kumar , Jeffrey J. Parker

IPC分类号： G06F9/44

CPC分类号： G06F15/17356 , G06F9/546

摘要： A method for passing messages in a parallel computer system constructed as a plurality of compute nodes interconnected as a network where each compute node includes a DMA engine but includes only a limited number of byte counters for tracking a number of bytes that are sent or received by the DMA engine, where the byte counters may be used in shared counter or exclusive counter modes of operation. The method includes using rendezvous protocol, a source compute node deterministically sending a request to send (RTS) message with a single RTS descriptor using an exclusive injection counter to track both the RTS message and message data to be sent in association with the RTS message, to a destination compute node such that the RTS descriptor indicates to the destination compute node that the message data will be adaptively routed to the destination node. Using one DMA FIFO at the source compute node, the RTS descriptors are maintained for rendezvous messages destined for the destination compute node to ensure proper message data ordering thereat. Using a reception counter at a DMA engine, the destination compute node tracks reception of the RTS and associated message data and sends a clear to send (CTS) message to the source node in a rendezvous protocol form of a remote get to accept the RTS message and message data and processing the remote get (CTS) by the source compute node DMA engine to provide the message data to be sent.

摘要翻译： 一种在并行计算机系统中传送消息的方法，该并行计算机系统被构造为作为网络互连的多个计算节点，其中每个计算节点包括DMA引擎，但是仅包括有限数量的字节计数器，用于跟踪由 DMA引擎，其中可以在共享计数器或专用计数器操作模式中使用字节计数器。该方法包括使用会合协议，源计算节点使用专用注入计数器确定性地发送具有单个RTS描述符的请求（RTS）消息以跟踪要与RTS消息相关联地发送的RTS消息和消息数据，到目的地计算节点，使得RTS描述符向目标计算节点指示消息数据将自适应地路由到目的地节点。在源计算节点使用一个DMA FIFO，将为发往目的地计算节点的会合消息保留RTS描述符，以确保正确的消息数据顺序。在DMA引擎上使用接收计数器，目的地计算节点跟踪RTS和相关联的消息数据的接收，并以远程获取的会合协议形式向源节点发送明确发送（CTS）消息以接受RTS消息和消息数据，并由源计算节点DMA引擎处理远程获取（CTS）以提供要发送的消息数据。

7.

发明申请
ASYNCRONOUS BROADCAST FOR ORDERED DELIVERY BETWEEN COMPUTE NODES IN A PARALLEL COMPUTING SYSTEM WHERE PACKET HEADER SPACE IS LIMITED 失效
标题翻译：有限公司在并行计算系统中的计算机之间的订单交付，其中包装头空间有限公司

公开(公告)号：US20090003344A1

公开(公告)日：2009-01-01

申请号：US11768619

申请日：2007-06-26

申请人： Sameer Kumar

发明人： Sameer Kumar

IPC分类号： H04L12/56

CPC分类号： H04L12/1854

摘要： Disclosed is a mechanism on receiving processors in a parallel computing system for providing order to data packets received from a broadcast call and to distinguish data packets received at nodes from several incoming asynchronous broadcast messages where header space is limited. In the present invention, processors at lower leafs of a tree do not need to obtain a broadcast message by directly accessing the data in a root processor's buffer. Instead, each subsequent intermediate node's rank id information is squeezed into the software header of packet headers. In turn, the entire broadcast message is not transferred from the root processor to each processor in a communicator but instead is replicated on several intermediate nodes which then replicated the message to nodes in lower leafs. Hence, the intermediate compute nodes become “virtual root compute nodes” for the purpose of replicating the broadcast message to lower levels of a tree.

摘要翻译： 公开了一种用于在并行计算系统中接收处理器的机制，用于向从广播呼叫接收的数据分组提供命令，并且将节点处接收到的数据分组与限制报头空间的若干输入异步广播消息进行区分。在本发明中，树的较低叶处理器不需要通过直接访问根处理器的缓冲器中的数据而获得广播消息。相反，每个后续的中间节点的等级id信息被挤压到分组头部的软件头部中。反过来，整个广播消息不是从根处理器传送到通信器中的每个处理器，而是复制在几个中间节点上，然后将消息复制到较低叶中的节点。因此，为了将广播消息复制到树的较低级别，中间计算节点变为“虚拟根计算节点”。

8.

发明授权
Integration of pore sealing liner into dual-damascene methods and devices 有权
标题翻译：将密封衬垫整合到双镶嵌方法和装置中

公开(公告)号：US07338893B2

公开(公告)日：2008-03-04

申请号：US11286877

申请日：2005-11-23

申请人： Edward Raymond Engbrecht , Satyavolu Srinivas Papa Rao , Sameer Kumar Ajmera , Stephan Grunow

发明人： Edward Raymond Engbrecht , Satyavolu Srinivas Papa Rao , Sameer Kumar Ajmera , Stephan Grunow

IPC分类号： H01L21/4763

CPC分类号： H01L21/76831 , H01L21/76844

摘要： A device employs damascene layers with a pore sealing liner and includes a semiconductor body. A metal interconnect layer comprising a metal interconnect is formed over the semiconductor body. A dielectric layer is formed over the metal interconnect layer. A conductive trench feature and a conductive via feature are formed in the dielectric layer. A pore sealing liner is formed only along sidewall of the conductive via feature and along sidewalls and bottom surfaces of the conductive trench feature. The pore sealing liner is not substantially present along a bottom surface of the conductive via feature.

摘要翻译： 装置采用具有孔密封衬垫的镶嵌层，并且包括半导体本体。包括金属互连的金属互连层形成在半导体本体上。介电层形成在金属互连层上。导电沟槽特征和导电通孔特征形成在电介质层中。孔密封衬垫仅沿着导电通孔特征的侧壁并且沿着导电沟槽特征的侧壁和底表面形成。孔密封衬垫基本上不存在于导电通孔特征的底表面上。

9.

发明申请
Method of route optimization with dual mobile node in IPv4-only network 审中-公开
标题翻译：仅在IPv4网络中使用双移动节点的路由优化方法

公开(公告)号：US20070258424A1

公开(公告)日：2007-11-08

申请号：US11585912

申请日：2006-10-25

申请人： Ranjitsinh Wable , Lakshmi Gurusamy , Sameer Kumar , Kishore Mundra , Syam Madanapalli

发明人： Ranjitsinh Wable , Lakshmi Gurusamy , Sameer Kumar , Kishore Mundra , Syam Madanapalli

IPC分类号： H04Q7/24

CPC分类号： H04W8/082 , H04L45/52 , H04L2212/00 , H04W40/34 , H04W80/04 , H04W80/045

摘要： A method of achieving route optimization (RO) when a dual capable mobile Internet protocol version 6 (MIPv6) mobile node is connected with an IPv4-only network allows RO of packets to traverse a shorter route than the default one through the home agent (HA) using bidirectional tunneling, and leads to better bandwidth utilization. The method of RO with a dual MIPV6 node in an IPV4-only network includes updating the HA with an IPv4 address of the MN and deregistering a binding update (BU) with a corresponding node (CN) via the HA; informing the CN about its IPv4 address and receiving the CN's IPv4 address in reply; checking reachability of the CN in its IPv4 address using an IPv6-in-IPv4 tunnel; and sending and receiving Ipv6 data packets to/from the CN using a v4 tunnel.

摘要翻译： 当双重能力的移动互联网协议版本6（MIPv6）移动节点与仅IPv4网络连接时，实现路由优化（RO）的方法允许分组的RO穿过比默认路由优先于通过归属代理（HA ）使用双向隧道，并带来更好的带宽利用率。在仅IPV4网络中具有双MIPV6节点的RO的方法包括用MN的IPv4地址更新HA并经由HA通过对应节点（CN）去注销绑定更新（BU）; 向CN通知其IPv4地址并接收CN的IPv4地址; 使用IPv6-in-IPv4隧道检查CN在其IPv4地址中的可达性; 并使用v4隧道向/从CN发送和接收Ipv6数据包。

10.

发明申请
Method for route optimization with dual mobile IPV4 node in IPV6-only network 有权
标题翻译：在IPV6网络中使用双移动IPV4节点进行路由优化的方法

公开(公告)号：US20070160065A1

公开(公告)日：2007-07-12

申请号：US11645671

申请日：2006-12-27

申请人： Kishore Mundra , Lakshmi Gurusamy , Sameer Kumar , Wable Udaysinh

发明人： Kishore Mundra , Lakshmi Gurusamy , Sameer Kumar , Wable Udaysinh

IPC分类号： H04L12/56

CPC分类号： H04W8/082 , H04W80/04 , H04W80/045

摘要： A method for route optimization with a dual mobile IPv4 node in an IPv6-only network is provided. The method includes the operations of: receiving a visited IPv6 address from a router when the dual mobile node is connected to the IPv6-only network; updating a home agent with the IPv6 address; deregistering a binding update with a correspondent node via the home agent; updating the correspondent node with an IPv6 address; checking the reachability of packets directly to the correspondent node using its IPv6 address; the mobile node starting sending, to the CN, data packets tunneled in an IPv6 packet once the reachability is verified; and the correspondent node sending tunneled data packets directly to an IPv6 address of the mobile node.

摘要翻译： 提供了一种仅在IPv6网络中使用双移动IPv4节点进行路由优化的方法。该方法包括以下操作：当双移动节点连接到仅IPv6网络时，从路由器接收到访问的IPv6地址; 使用IPv6地址更新归属代理; 通过归属代理注销与通信节点的绑定更新; 用IPv6地址更新对端节点; 使用其IPv6地址将数据包的可达性直接检查到对端节点; 移动节点开始向CN发送一旦IPv6包中隧道的数据包，一旦可达性得到验证，并且对端节点将隧道化数据分组直接发送到移动节点的IPv6地址。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类