DELAYED LINK COMPRESSION SCHEME
    1.
    发明申请

    公开(公告)号:US20190042496A1

    公开(公告)日:2019-02-07

    申请号:US16140472

    申请日:2018-09-24

    摘要: Apparatus, systems and methods for implementing delayed decompression schemes. As a burst of packets comprising compressed packets and uncompressed packets are received over an interconnect link, they are buffered in a receive buffer without decompression. Subsequently, the packets are forwarded from the receive buffer to a consumer such as processor core, with the compressed packets being decompressed prior to reaching the processor core. Under a first delayed decompression approach, packets are decompressed when they are read from the receive buffer in conjunction with forwarding the uncompressed packet (or uncompressed data contained therein) to the consumer. Under a second delayed decompression scheme, the packets are read from the receive buffer and forwarded to a decompressor using a first datapath width matching the width of the packets, decompressed, and then forwarded to the consumer using a second datapath width matching the width of the uncompressed data.

    Diffusion and cryptographic-related operations
    2.
    发明授权
    Diffusion and cryptographic-related operations 有权
    扩散和加密相关操作

    公开(公告)号:US08363828B2

    公开(公告)日:2013-01-29

    申请号:US12368196

    申请日:2009-02-09

    IPC分类号: G06F21/00

    摘要: An embodiment includes at least one processing unit to perform at least first and second sets of diffusion-related operations to produce a resulting block from a data block, and that includes at least one stage and at least one other stage. The at least one stage is to select one of first operands and second operands input to the at least one other stage. The first and second operands are respectively associated with the first and second sets of operations, respectively. The at least one other stage involves arithmetic and logical operations common to both the first and second sets of operations. At least one other processing unit is to perform at least one set of cryptographic-related operations (different, at least in part, from the first and second sets of operations) on at least one of (1) another block to produce the data block and (2) the resulting block.

    摘要翻译: 一个实施例包括至少一个处理单元,用于执行至少第一和第二组扩散相关操作以从数据块产生结果块,并且其包括至少一个阶段和至少一个其他阶段。 所述至少一个级是选择输入至少一个其他级的第一操作数和第二操作数之一。 第一和第二操作数分别分别与第一和第二组操作相关联。 所述至少一个其他阶段涉及对于第一和第二组操作共同的算术和逻辑运算。 至少一个其他处理单元将在(1)另一个块中的至少一个上执行至少一组密码相关操作(至少部分地不同于第一和第二组操作),以产生数据块 和(2)得到的块。

    LOW-LATENCY LINK COMPRESSION SCHEMES
    3.
    发明申请

    公开(公告)号:US20190045031A1

    公开(公告)日:2019-02-07

    申请号:US16014690

    申请日:2018-06-21

    IPC分类号: H04L29/06 H04L12/863

    摘要: Methods and apparatus for low-latency link compression schemes. Under the schemes, selected packets or messages are dynamically selected for compression in view of current transmit queue levels. The latency incurred during compression and decompression is not added to the data-path, but sits on the side of the transmit queue. The system monitors the queue depth and, accordingly, initiates compression jobs based on the depth. Different compression levels may be dynamically selected and used based on queue depth. Under various schemes, either packets or messages are enqueued in the transmit queue or pointers to such packets and messages are enqueued. Additionally, packets/message may be compressed prior to being enqueued, or after being enqueued, wherein an original uncompressed packet is replaced with a compressed packet. Compressed and uncompressed packets may be stored in queues or buffers and transmitted using a different numbers of transmit cycles based on their compression ratios. The schemes may be implemented to improve the effective bandwidth of various types of links, including serial links, bus-type links, and socket-to-socket links in multi-socket systems.

    DIFFUSION AND CRYPTOGRAPHIC-RELATED OPERATIONS
    4.
    发明申请
    DIFFUSION AND CRYPTOGRAPHIC-RELATED OPERATIONS 有权
    扩展和与CRIPTO图相关的操作

    公开(公告)号:US20100205455A1

    公开(公告)日:2010-08-12

    申请号:US12368196

    申请日:2009-02-09

    IPC分类号: H04L9/00

    摘要: An embodiment includes at least one processing unit to perform at least first and second sets of diffusion-related operations to produce a resulting block from a data block, and that includes at least one stage and at least one other stage. The at least one stage is to select one of first operands and second operands input to the at least one other stage. The first and second operands are respectively associated with the first and second sets of operations, respectively. The at least one other stage involves arithmetic and logical operations common to both the first and second sets of operations. At least one other processing unit is to perform at least one set of cryptographic-related operations (different, at least in part, from the first and second sets of operations) on at least one of (1) another block to produce the data block and (2) the resulting block.

    摘要翻译: 一个实施例包括至少一个处理单元,用于执行至少第一和第二组扩散相关操作以从数据块产生结果块,并且其包括至少一个阶段和至少一个其他阶段。 所述至少一个级是选择输入至少一个其他级的第一操作数和第二操作数之一。 第一和第二操作数分别分别与第一和第二组操作相关联。 所述至少一个其他阶段涉及对于第一和第二组操作共同的算术和逻辑运算。 至少一个其他处理单元将在(1)另一个块中的至少一个上执行至少一组密码相关操作(至少部分地不同于第一和第二组操作),以产生数据块 和(2)得到的块。

    APPARATUS AND METHOD FOR EFFICIENTLY EXECUTING BOOLEAN FUNCTIONS
    6.
    发明申请
    APPARATUS AND METHOD FOR EFFICIENTLY EXECUTING BOOLEAN FUNCTIONS 审中-公开
    有效执行布尔函数的装置和方法

    公开(公告)号:US20140095845A1

    公开(公告)日:2014-04-03

    申请号:US13631807

    申请日:2012-09-28

    IPC分类号: G06F9/30

    摘要: An apparatus and method are described for performing efficient Boolean operations in a pipelined processor which, in one embodiment, does not natively support three operand instructions. For example, a processor according to one embodiment of the invention comprises: a set of registers for storing packed operands; Boolean operation logic to execute a single instruction which uses three or more source operands packed in the set of registers, the Boolean operation logic to read at least three source operands and an immediate value to perform a Boolean operation on the three source operands, wherein the Boolean operation comprises: combining a bit read from each of the three operands to form an index to the immediate value, the index identifying a bit position within the immediate value; reading the bit from the identified bit position of the immediate value; and storing the bit from the identified bit position of the immediate value in a destination register.

    摘要翻译: 描述了一种用于在流水线处理器中执行有效的布尔运算的装置和方法,其在一个实施例中不本地支持三个操作数指令。 例如,根据本发明的一个实施例的处理器包括:一组用于存储打包操作数的寄存器; 用于执行单个指令的布尔运算逻辑,其使用打包在该组寄存器中的三个或更多个源操作数,布尔运算逻辑读取至少三个源操作数,并且立即值对三个源操作数执行布尔运算,其中, 布尔操作包括:组合从三个操作数中的每一个读取的位以形成立即值的索引,该索引标识立即值内的位位置; 从识别的位置读取该位从立即值; 并将来自所识别的立即值的比特位置的比特存储在目的地寄存器中。

    Factoring based modular exponentiation
    7.
    发明授权
    Factoring based modular exponentiation 有权
    基于分数的模幂运算

    公开(公告)号:US07961877B2

    公开(公告)日:2011-06-14

    申请号:US11610886

    申请日:2006-12-14

    CPC分类号: G06F7/723

    摘要: The present disclosure provides a system and method for performing modular exponentiation. The method may include dividing a first polynomial into a plurality of segments and generating a first product by multiplying the plurality of segments of the first polynomial with a second polynomial. The method may also include generating a second product by shifting the contents of an accumulator with a factorization base. The method may further include adding the first product and the second product to yield a first intermediate result and reducing the first intermediate result to yield a second intermediate result. The method may also include generating a public key based on, at least in part, the second intermediate result. Of course, many alternatives, variations and modifications are possible without departing from this embodiment.

    摘要翻译: 本公开提供了一种用于执行模幂运算的系统和方法。 该方法可以包括将第一多项式划分成多个段,并通过将第一多项式的多个段乘以第二多项式来生成第一乘积。 该方法还可以包括通过用因式分解基座移位累加器的内容来产生第二乘积。 该方法还可以包括添加第一产物和第二产物以产生第一中间结果并减少第一中间结果以产生第二中间结果。 该方法还可以包括至少部分地基于第二中间结果生成公钥。 当然,在不脱离本实施例的情况下,可以进行许多替代,变化和修改。

    RESIDUE GENERATION
    8.
    发明申请
    RESIDUE GENERATION 失效
    残留生成

    公开(公告)号:US20100153829A1

    公开(公告)日:2010-06-17

    申请号:US12336029

    申请日:2008-12-16

    IPC分类号: H03M13/09 G06F7/72 G06F11/10

    CPC分类号: G06F7/724 H03M13/091

    摘要: In one embodiment, circuitry is provided to generate a residue based at least in part upon operations and a data stream generated based at least in part upon a packet. The operations may include at least one iteration of at least one reduction operation including (a) multiplying a first value with at least one portion of the data stream, and (b) producing a reduction by adding at least one other portion of the data stream to a result of the multiplying. The operations may include at least one other reduction operation including (c) producing another result by multiplying with a second value at least one portion of another stream based at least in part upon the reduction, (d) producing a third value by adding at least one other portion of the another stream to the another result, and (e) producing the residue by performing a Barrett reduction based at least in part upon the third value.

    摘要翻译: 在一个实施例中,提供电路以至少部分地基于至少部分地基于分组产生的操作和数据流来生成残差。 操作可以包括至少一个缩减操作的迭代,包括(a)将第一值与数据流的至少一部分相乘,以及(b)通过添加数据流的至少一个其他部分来产生减少 是乘法的结果。 所述操作可以包括至少一个其它减少操作,其包括(c)至少部分地基于所述减少,通过与另一个流的至少一部分乘以第二值来产生另一结果,(d)通过至少加入来产生第三值 另一个流的另一部分到另一个结果,以及(e)至少部分地基于第三个值执行巴雷特还原来产生残留物。

    Apparatus and method for generating a Galois-field syndrome
    9.
    发明授权
    Apparatus and method for generating a Galois-field syndrome 失效
    用于产生伽罗瓦氏综合征的装置和方法

    公开(公告)号:US07607068B2

    公开(公告)日:2009-10-20

    申请号:US11469222

    申请日:2006-08-31

    IPC分类号: G11C29/00

    摘要: The present disclosure provides an apparatus and method for generating a Galois-field syndrome. One exemplary method may include loading a first data byte from a first storage device to a first register and loading a second data byte from a second storage device to a second register; ANDing the most significant bit (MSB) of the first data byte and a Galois-field polynomial to generate a first intermediate output; XORing each bit of the first intermediate output with the least significant bits (LSBs) of the first data byte to generate a second intermediate output; MUXing the second intermediate output with each bit of the first data byte to generate a third intermediate output; XORing each bit of the third intermediate output with each bit of the second data byte to generate at a fourth intermediate output; and generating a RAID Q syndrome based on, at least in part, the fourth intermediate output. Of course, many alternatives, variations and modifications are possible without departing from this embodiment.

    摘要翻译: 本公开提供了一种用于产生伽罗瓦域综合征的装置和方法。 一个示例性方法可以包括将第一数据字节从第一存储设备加载到第一寄存器,并将第二数据字节从第二存储设备加载到第二寄存器; 将第一数据字节的最高有效位(MSB)和伽罗瓦域多项式进行比较以产生第一中间输出; 用第一数据字节的最低有效位(LSB)对第一中间输出的每个位进行异或,以产生第二中间输出; 将第二中间输出与第一数据字节的每个位进行多路复用以产生第三中间​​输出; 将第三中间输出的每个位与第二数据字节的每个位进行异或,以在第四中间输出处产生; 以及至少部分地基于第四中间输出产生RAID Q综合征。 当然,在不脱离本实施例的情况下,可以进行许多替代,变化和修改。

    Carry/Borrow Handling
    10.
    发明申请
    Carry/Borrow Handling 审中-公开
    携带/借款处理

    公开(公告)号:US20080148011A1

    公开(公告)日:2008-06-19

    申请号:US11610897

    申请日:2006-12-14

    IPC分类号: G06F9/30

    CPC分类号: G06F9/3001

    摘要: The present disclosure provides a system and method for performing carry/borrow handling. A method according to one embodiment may include generating a first result having a first carry or borrow from a first mathematical operation and storing the first carry or borrow and a first pointer address in a temporary register. The method may further include generating a second result having a second carry or borrow from a second mathematical operation and calling a subroutine configured to perform carry and borrow handling. The method may also include copying the first pointer address from the temporary register into a global variable. Of course, many alternatives, variations and modifications are possible without departing from this embodiment.

    摘要翻译: 本公开提供了用于执行进位/借用处理的系统和方法。 根据一个实施例的方法可以包括从第一数学运算产生具有第一进位或借位的第一结果,并将第一进位或借位以及第一指针地址存储在临时寄存器中。 该方法还可以包括从第二数学运算产生具有第二进位或借位的第二结果,并调用被配置为执行进位和借位处理的子程序。 该方法还可以包括将第一指针地址从临时寄存器复制到全局变量中。 当然,在不脱离本实施例的情况下,可以进行许多替代,变化和修改。