专利检索 aee:"Institute of Computing Technology, Chinese Academy of Sciences" 第 1 页

1.

发明授权
Method and system for realizing FPGA server 有权

公开(公告)号：US11841733B2

公开(公告)日：2023-12-12

申请号：US17791511

申请日：2020-01-08

申请人： Institute of Computing Technology, Chinese Academy of Sciences

发明人： Ke Zhang , Yazhou Wang , Mingyu Chen , Yisong Chang , Ran Zhao , Yungang Bao

IPC分类号： G06F13/42

CPC分类号： G06F13/4282

摘要： A method and system for realizing a FPGA server, wherein centralized monitoring and managing all SoC FPGA compute nodes within the server by a motherboard, the motherboard comprising: a plurality of self-defined management interfaces for connecting the SoC FPGA compute nodes to supply power and data switch to the SoC FPGA compute nodes; a management network switch module for interconnecting the SoC FPGA compute nodes and supplying management; and a core control unit for managing the SoC FPGA compute nodes through the self-defined management interfaces and a self-defined management interface protocol, and acquiring operating parameters of the SoC FPGA compute nodes to manage and monitor the SoC FPGA compute nodes based on the management interface protocol.

2.

发明申请
METHOD AND SYSTEM FOR REALIZING FPGA SERVER 有权

公开(公告)号：US20230101208A1

公开(公告)日：2023-03-30

申请号：US17791511

申请日：2020-01-08

申请人： Institute of Computing Technology, Chinese Academy of Sciences

发明人： Ke ZHANG , Yazhou WANG , Mingyu CHEN , Yisong CHANG , Ran ZHAO , Yungang BAO

IPC分类号： G06F13/42

摘要： A method and system for realizing a FPGA server, wherein centralized monitoring and managing all SoC FPGA compute nodes within the server by a motherboard, the motherboard comprising: a plurality of self-defined management interfaces for connecting the SoC FPGA compute nodes to supply power and data switch to the SoC FPGA compute nodes; a management network switch module for interconnecting the SoC FPGA compute nodes and supplying management; and a core control unit for managing the SoC FPGA compute nodes through the self-defined management interfaces and a self-defined management interface protocol, and acquiring operating parameters of the SoC FPGA compute nodes to manage and monitor the SoC FPGA compute nodes based on the management interface protocol.

3.

发明授权
Fractal tree structure-based data transmit device and method, control device, and intelligent chip 有权

公开(公告)号：US11616662B2

公开(公告)日：2023-03-28

申请号：US17100570

申请日：2020-11-20

申请人： Institute of Computing Technology, Chinese Academy of Sciences

发明人： Jinhua Tao , Tao Luo , Shaoli Liu , Shijin Zhang , Yunji Chen

IPC分类号： H04L12/44 , H04L45/16 , H04L49/109

摘要： The present invention provides a fractal tree structure-based data transmit device and method, a control device, and an intelligent chip. The device comprises: a central node that is as a communication data center of a network-on-chip and used for broadcasting or multicasting communication data to a plurality of leaf nodes; the plurality of leaf nodes that are as communication data nodes of the network-on-chip and for transmitting the communication data to a central leaf node; and forwarder modules for connecting the central node with the plurality of leaf nodes and forwarding the communication data; the central node, the forwarder modules and the plurality of leaf nodes are connected in the fractal tree network structure, and the central node is directly connected to M the forwarder modules and/or leaf nodes, any the forwarder module is directly connected to M the next level forwarder modules and/or leaf nodes.

4.

发明申请
DATA PACKET CLASSIFICATION METHOD AND SYSTEM BASED ON CONVOLUTIONAL NEURAL NETWORK 有权

公开(公告)号：US20220374733A1

公开(公告)日：2022-11-24

申请号：US17761220

申请日：2019-12-27

申请人： INSTITUTE OF COMPUTING TECHNOLOGY, CHINESE ACADEMY OF SCIENCES

发明人： Gaogang XIE , Xinyi ZHANG , Penghao ZHANG

IPC分类号： G06N5/02 , G06K9/62 , G06N3/02

摘要： The disclosure provides a data packet classification method and system based on a convolutional neural network including merging each rule set in a training rule set to form a plurality of merging schemes, and determining an optimal merging scheme for each rule set in the training rule set on the basis of performance evaluation; converting a prefix combination distribution of each rule set in the training rule set and a target rule set into an image, and training a convolutional neural network model by taking the image and the corresponding optimal merging scheme as features; and classifying the target rule set on the basis of image similarity, and constructing a corresponding hash table for data packet classification.

5.

发明授权
Operation apparatus and method for acceleration chip for accelerating deep neural network algorithm 有权

公开(公告)号：US11488000B2

公开(公告)日：2022-11-01

申请号：US15770457

申请日：2016-06-17

申请人： Institute of Computing Technology, Chinese Academy of Sciences

发明人： Zhen Li , Shaoli Liu , Shijin Zhang , Tao Luo , Cheng Qian , Yunji Chen , Tianshi Chen

IPC分类号： G06N3/063 , G06N3/08 , G06N3/04 , G06F17/16

摘要： The present disclosure provides an operation apparatus and method for an acceleration chip for accelerating a deep neural network algorithm. The apparatus comprises: a vector addition processor module and a vector function value arithmetic unit and a vector multiplier-adder module wherein the three modules execute a programmable instruction, and interact with each other to calculate values of neurons and a network output result of a neural network, and a variation amount of a synaptic weight representing the interaction strength of the neurons on an input layer to the neurons on an output layer; and the three modules are all provided with an intermediate value storage region and perform read and write operations on a primary memory.

6.

发明申请
CONVOLUTIONAL NEURAL NETWORK COMPUTING METHOD AND SYSTEM BASED ON WEIGHT KNEADING 有权

公开(公告)号：US20210350214A1

公开(公告)日：2021-11-11

申请号：US17250892

申请日：2019-05-21

申请人： Institute of Computing Technology, Chinese Academy of Sciences

发明人： Xiaowei LI , Xin WEI , Hang LU

IPC分类号： G06N3/04 , G06F7/50 , G06F5/01 , H03M7/40

摘要： Disclosed embodiments relate to a convolutional neural network computing method and system based on weight kneading, comprising: arranging original weights in a computation sequence and aligning by bit to obtain a weight matrix, removing slack bits in the weight matrix, allowing essential bits in each column of the weight matrix to fill the vacancies according to the computation sequence to obtain an intermediate matrix, removing null rows in the intermediate matrix, obtain a kneading matrix, wherein each row of the kneading matrix serves as a kneading weight; obtaining positional information of the activation corresponding to each bit of the kneading weight; divides the kneading weight by bit into multiple weight segments, processing summation of the weight segments and the corresponding activations according to the positional information, and sending a processing result to an adder tree to obtain an output feature map by means of executing shift-and-add on the processing result.

7.

发明授权
Dynamic resources allocation method and system for guaranteeing tail latency SLO of latency-sensitive application 有权

公开(公告)号：US11977784B2

公开(公告)日：2024-05-07

申请号：US17908495

申请日：2020-07-06

申请人： Institute of Computing Technology, Chinese Academy of Sciences

发明人： Liuying Ma , Zhenqing Liu , Jin Xiong , Dejun Jiang

IPC分类号： G06F3/06

CPC分类号： G06F3/0659 , G06F3/0611 , G06F3/0622 , G06F3/067

摘要： The present invention proposes a dynamic resources allocation method and system for guaranteeing tail latency SLO of latency-sensitive applications. A plurality of request queues is created in a storage server node of a distributed storage system with different types of requests located in different queues, and thread groups are allocated to the request queues according to logical thread resources of the service node and target tail latency requirements, and thread resources are dynamically allocated in real time, and the thread group of each request queue is bound to physical CPU resources of the storage server node. The client sends an application's requests to the storage server node; the storage server node stores the request in a request queue corresponding to its type, uses the thread group allocated for the current queue to process the application's requests, and sends responses to the client.

8.

发明申请
SPLIT ACCUMULATOR FOR CONVOLUTIONAL NEURAL NETWORK ACCELERATOR 有权

公开(公告)号：US20210357735A1

公开(公告)日：2021-11-18

申请号：US17250890

申请日：2019-05-21

申请人： Institute of Computing Technology, Chinese Academy of Sciences

发明人： Xiaowei LI , Xin WEI , Hang LU

IPC分类号： G06N3/063 , G06N3/04

摘要： Disclosed embodiments relate to a split accumulator for a convolutional neural network accelerator, comprising: arranging original weights in a computation sequence and aligning by bit to obtain a weight matrix, removing slack bits in the weight matrix, allowing essential bits in each column of the weight matrix to fill the vacancies according to the computation sequence to obtain an intermediate matrix, removing null rows in the intermediate matrix, obtain a kneading matrix, wherein each row of the kneading matrix serves as a kneading weight; obtaining positional information of the activation corresponding to each bit of the kneading weight; divides the kneading weight by bit into multiple weight segments, processing summation of the weight segments and the corresponding activations according to the positional information, and sending a processing result to an adder tree to obtain an output feature map by means of executing shift-and-add on the processing result.

9.

发明申请
CONVOLUTIONAL NEURAL NETWORK ACCELERATOR 有权

公开(公告)号：US20210350204A1

公开(公告)日：2021-11-11

申请号：US17250889

申请日：2019-05-21

申请人： Institute of Computing Technology, Chinese Academy of Sciences

发明人： Xiaowei LI , Xin WEI , Hang LU

IPC分类号： G06N3/04

摘要： Disclosed embodiments relate to a convolutional neural network accelerator, comprising: arranging original weights in a computation sequence and aligning by bit to obtain a weight matrix, removing slack bits in the weight matrix, allowing essential bits in each column of the weight matrix to fill the vacancies according to the computation sequence to obtain an intermediate matrix, removing null rows in the intermediate matrix, obtain a kneading matrix, wherein each row of the kneading matrix serves as a kneading weight; obtaining positional information of the activation corresponding to each bit of the kneading weight; divides the kneading weight by bit into multiple weight segments, processing summation of the weight segments and the corresponding activations according to the positional information, and sending a processing result to an adder tree to obtain an output feature map by means of executing shift-and-add on the processing result.

10.

发明授权
Method for obtaining image reference block in a code of mode of fixed reference frame number 有权
标题翻译：用于在固定参考帧号的模式码中获取图像参考块的方法

公开(公告)号：US07974344B2

公开(公告)日：2011-07-05

申请号：US10584777

申请日：2004-07-08

申请人： Xiangyang Ji , Wen Gao , Siwei Ma , Debin Zhao , Yan Lu

发明人： Xiangyang Ji , Wen Gao , Siwei Ma , Debin Zhao , Yan Lu

IPC分类号： H04N7/12

CPC分类号： H04N19/577

摘要： A “rounding to zero” method can maintain the exact motion vector and can also be achieved by the method without division so as to improve the precision of calculating the motion vector, embody the motion of the object in video more factually, and obtain the more accurate motion vector prediction. Combining with the forward prediction coding and the backward prediction coding, the present invention realizes a new prediction coding mode, which can guarantee the high efficiency of coding in direct mode as well as is convenient for hardware realization, and gains the same effect as the conventional B frame coding.

摘要翻译： “舍入到零”方法可以保持精确的运动矢量，也可以通过不分割的方法来实现，以提高计算运动矢量的精度，更实际地体现视频中物体的运动，并获得更多准确的运动矢量预测。结合前向预测编码和后向预测编码，本发明实现了一种新的预测编码模式，可以保证直接模式下的编码效率高，便于硬件实现，并获得与常规编码相同的效果 B帧编码。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类