Method and system for realizing FPGA server

    公开(公告)号:US11841733B2

    公开(公告)日:2023-12-12

    申请号:US17791511

    申请日:2020-01-08

    IPC分类号: G06F13/42

    CPC分类号: G06F13/4282

    摘要: A method and system for realizing a FPGA server, wherein centralized monitoring and managing all SoC FPGA compute nodes within the server by a motherboard, the motherboard comprising: a plurality of self-defined management interfaces for connecting the SoC FPGA compute nodes to supply power and data switch to the SoC FPGA compute nodes; a management network switch module for interconnecting the SoC FPGA compute nodes and supplying management; and a core control unit for managing the SoC FPGA compute nodes through the self-defined management interfaces and a self-defined management interface protocol, and acquiring operating parameters of the SoC FPGA compute nodes to manage and monitor the SoC FPGA compute nodes based on the management interface protocol.

    METHOD AND SYSTEM FOR REALIZING FPGA SERVER

    公开(公告)号:US20230101208A1

    公开(公告)日:2023-03-30

    申请号:US17791511

    申请日:2020-01-08

    IPC分类号: G06F13/42

    摘要: A method and system for realizing a FPGA server, wherein centralized monitoring and managing all SoC FPGA compute nodes within the server by a motherboard, the motherboard comprising: a plurality of self-defined management interfaces for connecting the SoC FPGA compute nodes to supply power and data switch to the SoC FPGA compute nodes; a management network switch module for interconnecting the SoC FPGA compute nodes and supplying management; and a core control unit for managing the SoC FPGA compute nodes through the self-defined management interfaces and a self-defined management interface protocol, and acquiring operating parameters of the SoC FPGA compute nodes to manage and monitor the SoC FPGA compute nodes based on the management interface protocol.

    Fractal tree structure-based data transmit device and method, control device, and intelligent chip

    公开(公告)号:US11616662B2

    公开(公告)日:2023-03-28

    申请号:US17100570

    申请日:2020-11-20

    摘要: The present invention provides a fractal tree structure-based data transmit device and method, a control device, and an intelligent chip. The device comprises: a central node that is as a communication data center of a network-on-chip and used for broadcasting or multicasting communication data to a plurality of leaf nodes; the plurality of leaf nodes that are as communication data nodes of the network-on-chip and for transmitting the communication data to a central leaf node; and forwarder modules for connecting the central node with the plurality of leaf nodes and forwarding the communication data; the central node, the forwarder modules and the plurality of leaf nodes are connected in the fractal tree network structure, and the central node is directly connected to M the forwarder modules and/or leaf nodes, any the forwarder module is directly connected to M the next level forwarder modules and/or leaf nodes.

    DATA PACKET CLASSIFICATION METHOD AND SYSTEM BASED ON CONVOLUTIONAL NEURAL NETWORK

    公开(公告)号:US20220374733A1

    公开(公告)日:2022-11-24

    申请号:US17761220

    申请日:2019-12-27

    IPC分类号: G06N5/02 G06K9/62 G06N3/02

    摘要: The disclosure provides a data packet classification method and system based on a convolutional neural network including merging each rule set in a training rule set to form a plurality of merging schemes, and determining an optimal merging scheme for each rule set in the training rule set on the basis of performance evaluation; converting a prefix combination distribution of each rule set in the training rule set and a target rule set into an image, and training a convolutional neural network model by taking the image and the corresponding optimal merging scheme as features; and classifying the target rule set on the basis of image similarity, and constructing a corresponding hash table for data packet classification.

    CONVOLUTIONAL NEURAL NETWORK COMPUTING METHOD AND SYSTEM BASED ON WEIGHT KNEADING

    公开(公告)号:US20210350214A1

    公开(公告)日:2021-11-11

    申请号:US17250892

    申请日:2019-05-21

    摘要: Disclosed embodiments relate to a convolutional neural network computing method and system based on weight kneading, comprising: arranging original weights in a computation sequence and aligning by bit to obtain a weight matrix, removing slack bits in the weight matrix, allowing essential bits in each column of the weight matrix to fill the vacancies according to the computation sequence to obtain an intermediate matrix, removing null rows in the intermediate matrix, obtain a kneading matrix, wherein each row of the kneading matrix serves as a kneading weight; obtaining positional information of the activation corresponding to each bit of the kneading weight; divides the kneading weight by bit into multiple weight segments, processing summation of the weight segments and the corresponding activations according to the positional information, and sending a processing result to an adder tree to obtain an output feature map by means of executing shift-and-add on the processing result.

    SPLIT ACCUMULATOR FOR CONVOLUTIONAL NEURAL NETWORK ACCELERATOR

    公开(公告)号:US20210357735A1

    公开(公告)日:2021-11-18

    申请号:US17250890

    申请日:2019-05-21

    IPC分类号: G06N3/063 G06N3/04

    摘要: Disclosed embodiments relate to a split accumulator for a convolutional neural network accelerator, comprising: arranging original weights in a computation sequence and aligning by bit to obtain a weight matrix, removing slack bits in the weight matrix, allowing essential bits in each column of the weight matrix to fill the vacancies according to the computation sequence to obtain an intermediate matrix, removing null rows in the intermediate matrix, obtain a kneading matrix, wherein each row of the kneading matrix serves as a kneading weight; obtaining positional information of the activation corresponding to each bit of the kneading weight; divides the kneading weight by bit into multiple weight segments, processing summation of the weight segments and the corresponding activations according to the positional information, and sending a processing result to an adder tree to obtain an output feature map by means of executing shift-and-add on the processing result.

    CONVOLUTIONAL NEURAL NETWORK ACCELERATOR

    公开(公告)号:US20210350204A1

    公开(公告)日:2021-11-11

    申请号:US17250889

    申请日:2019-05-21

    IPC分类号: G06N3/04

    摘要: Disclosed embodiments relate to a convolutional neural network accelerator, comprising: arranging original weights in a computation sequence and aligning by bit to obtain a weight matrix, removing slack bits in the weight matrix, allowing essential bits in each column of the weight matrix to fill the vacancies according to the computation sequence to obtain an intermediate matrix, removing null rows in the intermediate matrix, obtain a kneading matrix, wherein each row of the kneading matrix serves as a kneading weight; obtaining positional information of the activation corresponding to each bit of the kneading weight; divides the kneading weight by bit into multiple weight segments, processing summation of the weight segments and the corresponding activations according to the positional information, and sending a processing result to an adder tree to obtain an output feature map by means of executing shift-and-add on the processing result.

    Method for obtaining image reference block in a code of mode of fixed reference frame number
    10.
    发明授权
    Method for obtaining image reference block in a code of mode of fixed reference frame number 有权
    用于在固定参考帧号的模式码中获取图像参考块的方法

    公开(公告)号:US07974344B2

    公开(公告)日:2011-07-05

    申请号:US10584777

    申请日:2004-07-08

    IPC分类号: H04N7/12

    CPC分类号: H04N19/577

    摘要: A “rounding to zero” method can maintain the exact motion vector and can also be achieved by the method without division so as to improve the precision of calculating the motion vector, embody the motion of the object in video more factually, and obtain the more accurate motion vector prediction. Combining with the forward prediction coding and the backward prediction coding, the present invention realizes a new prediction coding mode, which can guarantee the high efficiency of coding in direct mode as well as is convenient for hardware realization, and gains the same effect as the conventional B frame coding.

    摘要翻译: “舍入到零”方法可以保持精确的运动矢量,也可以通过不分割的方法来实现,以提高计算运动矢量的精度,更实际地体现视频中物体的运动,并获得更多 准确的运动矢量预测。 结合前向预测编码和后向预测编码,本发明实现了一种新的预测编码模式,可以保证直接模式下的编码效率高,便于硬件实现,并获得与常规编码相同的效果 B帧编码。