CALCULATOR AND ASSOCIATED METHOD
    71.
    发明公开

    公开(公告)号:US20230385373A1

    公开(公告)日:2023-11-30

    申请号:US18053524

    申请日:2022-11-08

    IPC分类号: G06F17/15 G06F7/72 G06F7/76

    摘要: The present application discloses a calculator and a method thereof. The calculator is configured to accelerate the number-theoretic transformation of a 2N-dimensional polynomial. The calculator includes a first coefficient memory, a second coefficient memory, a twiddle factor memory, a plurality of processing units and a data flow controller. In the odd-number rounds of coefficient computation operations, the processing units perform first calculation procedures to read coefficients from the first coefficient memory for modulo calculation, and perform first writing procedures to write output coefficients to the second coefficient memory. In even-number rounds of coefficient computation operations, the processing units performs second calculation procedures to read coefficients from the second coefficient memory for modulo calculations, and perform second writing procedures to write output coefficients to the first coefficient memory.

    GPU AND METHOD OF THE SAME
    72.
    发明公开

    公开(公告)号:US20230367741A1

    公开(公告)日:2023-11-16

    申请号:US17937973

    申请日:2022-10-04

    IPC分类号: G06F15/82 G06F9/30

    CPC分类号: G06F15/82 G06F9/3013

    摘要: The present application discloses a GPU and a method of the same. The GPU includes: a plurality of streaming multiprocessor (SMs), each including: a plurality of streaming processors (SPs), each including a register, wherein each SP has a predetermined upper bound of warp number, and the register has a predetermined upper bound of register capacity; and a global dispatcher, including: a register occupancy status table, for recording the warp number and an occupancy status of the register of each SP of each SM; a TB (TB) dispatch module, for dispatching the TB to a first SM of the SMs according to a warp type classification table and the register occupancy status table; and a warp dispatch module, for dispatching a plurality of warps to the plurality of SPs of the first SM according to the warp type classification table and the register occupancy status table.

    STREAM MULTIPLEPROCESSOR, GPU, AND RELATED METHOD

    公开(公告)号:US20230367630A1

    公开(公告)日:2023-11-16

    申请号:US18055205

    申请日:2022-11-14

    IPC分类号: G06F9/48 G06F9/52 G06F1/20

    CPC分类号: G06F9/4875 G06F9/52 G06F1/20

    摘要: A stream multiprocessor, a GPU, and related methods are provided. The stream multiprocessor executes thread blocks. Each thread block includes warps. The stream multiprocessor includes stream processors and a local dispatcher. Each stream processor executes one or more warps. The local dispatcher includes a warp state table, a warp resource detection unit and a warp launching unit. The warp state table records dispatching states and processing states of warps of the thread blocks. The warp resource detection unit selects all the first warps of a first thread block and at least one second warp of a second thread block according to hardware resources available to the stream multiprocessor and hardware resources required for thread blocks. The warp launching unit dispatches the first warps to idle stream processors and at least one second warp to at least one idle stream processor.

    Communication method, apparatus, electronic device, and storage medium

    公开(公告)号:US11689586B2

    公开(公告)日:2023-06-27

    申请号:US17737701

    申请日:2022-05-05

    发明人: Xiangjun Li

    摘要: Communication method, apparatus, electronic device and storage medium are provided. The communication method is applied to a video server having an IP network connection with a telephone network node, and includes: receiving a first video call including an identification of a first dialing device from the telephone network node based on the IP network connection; determining a first video channel with the first dialing device and second video channels with a first IP terminal device and a second IP terminal device, wherein the first dialing device and the telephone network node are located in a telephone network, and the first IP terminal device, the second IP terminal device and the video server are located in an IP network; and performing channel combination on the first video channel and the second video channels to establish a multi-party video call between the first dialing device and multiple IP terminal devices.

    SYSTEMS AND METHODS FOR PARALLEL PROCESSING
    75.
    发明公开

    公开(公告)号:US20230185749A1

    公开(公告)日:2023-06-15

    申请号:US17824814

    申请日:2022-05-25

    IPC分类号: G06F13/40

    摘要: A system includes a high-bandwidth inter-chip network (ICN) that allows communication between neural network processing units (NPUs) in the system. For example, the ICN allows an NPU to communicate with other NPUs on the same compute node (server) and also with NPUs on other compute nodes (servers). Communication can be at the direct memory access (DMA) command level and at the finer-grained load/store instruction level. The ICN system and the programming model allows NPUs in the system to communicate without using a traditional network (e.g., Ethernet) that uses a relatively narrow and slow Peripheral Component Interconnect Express (PCIe) bus.

    COMPUTER-IMPLEMENTED METHOD, SYSTEM, AND STORAGE MEDIUM FOR PREFETCHING IN A DISTRIBUTED GRAPH ARCHITECTURE

    公开(公告)号:US20220417324A1

    公开(公告)日:2022-12-29

    申请号:US17835911

    申请日:2022-06-08

    IPC分类号: H04L67/1097 G06F16/901

    摘要: Various embodiments of the present disclosure relate to a computer-implemented method, a system, and a storage medium, where a graph stored in a computing system is logically divided into subgraphs, the subgraphs are stored on different interconnected (or coupled) devices in the computing system, and nodes of the subgraphs include hub nodes connected to adjacent subgraphs. Each device stores attributes and node structure information of the hub nodes of the subgraphs into other devices, and software or hardware prefetch engine on the device prefetches attributes and node structure information associated with a sampled node. A prefetcher on a device interfacing with the interconnected (or coupled) devices may further prefetch attributes and node structure information of nodes of the subgraphs on other devices. A traffic monitor is provided on an interface device to monitor traffic. When the traffic is small, the interface device prefetches node attributes and node structure information.

    SAMPLER AND APPARATUS FOR EXECUTING GRAPH NEURAL NETWORK MODEL

    公开(公告)号:US20220318592A1

    公开(公告)日:2022-10-06

    申请号:US17677927

    申请日:2022-02-22

    IPC分类号: G06N3/04 G06F7/58

    摘要: A sampler for executing a graph neural network (GNN) model are disclosed. The sampler is configured to implement random sampling for neighbor nodes around a specified node of a GNN model, and performs: obtaining a quantity of neighbor nodes around the specified node and a target number of neighbor nodes to be sampled; dividing a range into a plurality of subranges based on the target number; generating random numbers; determining a plurality of integer values within the plurality of subranges based on the random numbers; determining index values of the target number of neighbor nodes to be sampled by matching index values of the neighbor nodes and the plurality of determined integer values; and writing the determined index values into an output buffer. The sampler provided in the present disclosure can uniformly sample the neighbor nodes around the specified node for the specified node.

    Data Processing Method, Apparatus, Electronic Device, and Computer Storage Medium

    公开(公告)号:US20220309249A1

    公开(公告)日:2022-09-29

    申请号:US17701455

    申请日:2022-03-22

    摘要: Data processing method, apparatus, electronic device, and computer storage medium are provided. The data processing method is used to generate a description information file related to a target object, and includes: obtaining a description framework and multiple types of multiple materials related to the target object, the description framework including attribute selection information corresponding to the target object; performing at least one type of processing on each material to obtain attribute information of the respective material, and the attribute information including an attribute level and an attribute content; selecting target materials whose attribute content and attribute level match the attribute selection information of the description framework; and generating a description information file according to the description framework and the target materials. The data processing method may automatically generate description information files.

    METHOD AND SYSTEM FOR PREDICTING AFFINITY BETWEEN DRUG AND TARGET

    公开(公告)号:US20220284990A1

    公开(公告)日:2022-09-08

    申请号:US17592924

    申请日:2022-02-04

    发明人: Ying Chi Peiran Jiang

    IPC分类号: G16C20/30 G16C20/50 G16C20/70

    摘要: Prediction of an affinity between a drug and a target is disclosed. The drug and the target for interacting with the drug are acquired. An interaction is caused between the drug and the target to determine the compound. Topological structure graphs for labeling spatial characteristics of atoms in the compound are computed. An affinity between the drug and the target based on the topological structure graphs is determined.

    METHOD AND DEVICE FOR GENERATING DATA FLOW POLICY

    公开(公告)号:US20240311196A1

    公开(公告)日:2024-09-19

    申请号:US18606882

    申请日:2024-03-15

    IPC分类号: G06F9/50

    摘要: Some embodiments of this disclosure provide a method and an apparatus for generating a data flow policy. For example, a method for generating a data flow policy includes: obtaining a computational graph corresponding to a data processing task; generating an inter-stage data flow policy based on the computational graph and an execution cost; generating a plurality of intra-stage data flow policies corresponding to the plurality of pipeline stages based on the inter-stage data flow policy; and updating the execution cost based on the plurality of intra-stage data flow policies.