-
公开(公告)号:US20230385373A1
公开(公告)日:2023-11-30
申请号:US18053524
申请日:2022-11-08
发明人: ZHAOHUI CHEN , XUANLE REN , YANHENG LU , JIANSONG ZHANG
CPC分类号: G06F17/156 , G06F7/727 , G06F7/722 , G06F7/76
摘要: The present application discloses a calculator and a method thereof. The calculator is configured to accelerate the number-theoretic transformation of a 2N-dimensional polynomial. The calculator includes a first coefficient memory, a second coefficient memory, a twiddle factor memory, a plurality of processing units and a data flow controller. In the odd-number rounds of coefficient computation operations, the processing units perform first calculation procedures to read coefficients from the first coefficient memory for modulo calculation, and perform first writing procedures to write output coefficients to the second coefficient memory. In even-number rounds of coefficient computation operations, the processing units performs second calculation procedures to read coefficients from the second coefficient memory for modulo calculations, and perform second writing procedures to write output coefficients to the first coefficient memory.
-
公开(公告)号:US20230367741A1
公开(公告)日:2023-11-16
申请号:US17937973
申请日:2022-10-04
发明人: HAORAN LI , FEI SUN , YUAN GAO , GUYUE HUANG , RUIGUANG ZHONG , CHEN ZHANG
CPC分类号: G06F15/82 , G06F9/3013
摘要: The present application discloses a GPU and a method of the same. The GPU includes: a plurality of streaming multiprocessor (SMs), each including: a plurality of streaming processors (SPs), each including a register, wherein each SP has a predetermined upper bound of warp number, and the register has a predetermined upper bound of register capacity; and a global dispatcher, including: a register occupancy status table, for recording the warp number and an occupancy status of the register of each SP of each SM; a TB (TB) dispatch module, for dispatching the TB to a first SM of the SMs according to a warp type classification table and the register occupancy status table; and a warp dispatch module, for dispatching a plurality of warps to the plurality of SPs of the first SM according to the warp type classification table and the register occupancy status table.
-
公开(公告)号:US20230367630A1
公开(公告)日:2023-11-16
申请号:US18055205
申请日:2022-11-14
CPC分类号: G06F9/4875 , G06F9/52 , G06F1/20
摘要: A stream multiprocessor, a GPU, and related methods are provided. The stream multiprocessor executes thread blocks. Each thread block includes warps. The stream multiprocessor includes stream processors and a local dispatcher. Each stream processor executes one or more warps. The local dispatcher includes a warp state table, a warp resource detection unit and a warp launching unit. The warp state table records dispatching states and processing states of warps of the thread blocks. The warp resource detection unit selects all the first warps of a first thread block and at least one second warp of a second thread block according to hardware resources available to the stream multiprocessor and hardware resources required for thread blocks. The warp launching unit dispatches the first warps to idle stream processors and at least one second warp to at least one idle stream processor.
-
公开(公告)号:US11689586B2
公开(公告)日:2023-06-27
申请号:US17737701
申请日:2022-05-05
发明人: Xiangjun Li
IPC分类号: H04L65/403 , H04L65/1069 , H04L65/1104
CPC分类号: H04L65/1069 , H04L65/1104 , H04L65/403
摘要: Communication method, apparatus, electronic device and storage medium are provided. The communication method is applied to a video server having an IP network connection with a telephone network node, and includes: receiving a first video call including an identification of a first dialing device from the telephone network node based on the IP network connection; determining a first video channel with the first dialing device and second video channels with a first IP terminal device and a second IP terminal device, wherein the first dialing device and the telephone network node are located in a telephone network, and the first IP terminal device, the second IP terminal device and the video server are located in an IP network; and performing channel combination on the first video channel and the second video channels to establish a multi-party video call between the first dialing device and multiple IP terminal devices.
-
公开(公告)号:US20230185749A1
公开(公告)日:2023-06-15
申请号:US17824814
申请日:2022-05-25
发明人: Liang HAN , ChengYuan WU , Guoyu ZHU , Rong ZHONG , Yang JIAO , Ye LU , Wei WU , Yunxiao ZOU , Li YIN
IPC分类号: G06F13/40
CPC分类号: G06F13/4022 , G06F13/4063 , G06F2213/0026
摘要: A system includes a high-bandwidth inter-chip network (ICN) that allows communication between neural network processing units (NPUs) in the system. For example, the ICN allows an NPU to communicate with other NPUs on the same compute node (server) and also with NPUs on other compute nodes (servers). Communication can be at the direct memory access (DMA) command level and at the finer-grained load/store instruction level. The ICN system and the programming model allows NPUs in the system to communicate without using a traditional network (e.g., Ethernet) that uses a relatively narrow and slow Peripheral Component Interconnect Express (PCIe) bus.
-
76.
公开(公告)号:US20220417324A1
公开(公告)日:2022-12-29
申请号:US17835911
申请日:2022-06-08
发明人: Wei HAN , Shuangcheng LI , Hongzhong ZHENG , Yawen ZHANG , Heng LIU , Dimin NIU
IPC分类号: H04L67/1097 , G06F16/901
摘要: Various embodiments of the present disclosure relate to a computer-implemented method, a system, and a storage medium, where a graph stored in a computing system is logically divided into subgraphs, the subgraphs are stored on different interconnected (or coupled) devices in the computing system, and nodes of the subgraphs include hub nodes connected to adjacent subgraphs. Each device stores attributes and node structure information of the hub nodes of the subgraphs into other devices, and software or hardware prefetch engine on the device prefetches attributes and node structure information associated with a sampled node. A prefetcher on a device interfacing with the interconnected (or coupled) devices may further prefetch attributes and node structure information of nodes of the subgraphs on other devices. A traffic monitor is provided on an interface device to monitor traffic. When the traffic is small, the interface device prefetches node attributes and node structure information.
-
公开(公告)号:US20220318592A1
公开(公告)日:2022-10-06
申请号:US17677927
申请日:2022-02-22
发明人: Tianchan GUAN , Yanhong WANG , Shuangchen LI , Heng LIU , Hongzhong ZHENG
摘要: A sampler for executing a graph neural network (GNN) model are disclosed. The sampler is configured to implement random sampling for neighbor nodes around a specified node of a GNN model, and performs: obtaining a quantity of neighbor nodes around the specified node and a target number of neighbor nodes to be sampled; dividing a range into a plurality of subranges based on the target number; generating random numbers; determining a plurality of integer values within the plurality of subranges based on the random numbers; determining index values of the target number of neighbor nodes to be sampled by matching index values of the neighbor nodes and the plurality of determined integer values; and writing the determined index values into an output buffer. The sampler provided in the present disclosure can uniformly sample the neighbor nodes around the specified node for the specified node.
-
公开(公告)号:US20220309249A1
公开(公告)日:2022-09-29
申请号:US17701455
申请日:2022-03-22
发明人: Xuming Lin , Zhongzhou Zhao , Shuiling He , Liming Pu , Ji Zhang
摘要: Data processing method, apparatus, electronic device, and computer storage medium are provided. The data processing method is used to generate a description information file related to a target object, and includes: obtaining a description framework and multiple types of multiple materials related to the target object, the description framework including attribute selection information corresponding to the target object; performing at least one type of processing on each material to obtain attribute information of the respective material, and the attribute information including an attribute level and an attribute content; selecting target materials whose attribute content and attribute level match the attribute selection information of the description framework; and generating a description information file according to the description framework and the target materials. The data processing method may automatically generate description information files.
-
公开(公告)号:US20220284990A1
公开(公告)日:2022-09-08
申请号:US17592924
申请日:2022-02-04
发明人: Ying Chi , Peiran Jiang
摘要: Prediction of an affinity between a drug and a target is disclosed. The drug and the target for interacting with the drug are acquired. An interaction is caused between the drug and the target to determine the compound. Topological structure graphs for labeling spatial characteristics of atoms in the compound are computed. An affinity between the drug and the target based on the topological structure graphs is determined.
-
公开(公告)号:US20240311196A1
公开(公告)日:2024-09-19
申请号:US18606882
申请日:2024-03-15
发明人: Haoran Li , Fei Xue , Ruiguang Zhong , Yuan Gao , Yuanwei Fang , Peiye Liu
IPC分类号: G06F9/50
CPC分类号: G06F9/5038 , G06F9/5044 , G06F2209/501
摘要: Some embodiments of this disclosure provide a method and an apparatus for generating a data flow policy. For example, a method for generating a data flow policy includes: obtaining a computational graph corresponding to a data processing task; generating an inter-stage data flow policy based on the computational graph and an execution cost; generating a plurality of intra-stage data flow policies corresponding to the plurality of pipeline stages based on the inter-stage data flow policy; and updating the execution cost based on the plurality of intra-stage data flow policies.
-
-
-
-
-
-
-
-
-