Method and apparatus for performing table lookup
    31.
    发明授权
    Method and apparatus for performing table lookup 有权
    执行表查找的方法和装置

    公开(公告)号:US09058284B1

    公开(公告)日:2015-06-16

    申请号:US13422979

    申请日:2012-03-16

    IPC分类号: G06F12/10

    摘要: Method and apparatus for performing table lookup are disclosed. In one embodiment, the method includes providing a lookup table, where the lookup table includes a plurality of translation modes and each translation mode includes a corresponding translation table tree supporting a plurality of page sizes. The method further includes receiving a search request from a requester, determining a translation table tree for conducting the search request, determining a lookup sequence based on the translation table tree, generating a search output using the lookup sequence, and transmitting the search output to the requester. The plurality of translation modes includes a first set of page sizes for 32-bit operating system software and a second set of page sizes for 64-bit operating system software. The plurality of page sizes includes non-global pages, global pages, and both non-global and global pages.

    摘要翻译: 公开了用于执行表查找的方法和装置。 在一个实施例中,该方法包括提供查找表,其中查找表包括多个翻译模式,并且每个翻译模式包括支持多个页面大小的对应的翻译表格树。 该方法还包括从请求者接收搜索请求,确定用于进行搜索请求的翻译表格树,基于转换表格树确定查找序列,使用查找序列生成搜索输出,并将搜索输出发送到 请求者 多个翻译模式包括用于32位操作系统软件的第一组页面大小和用于64位操作系统软件的第二组页面大小。 多个页面大小包括非全局页面,全局页面以及非全局页面和全局页面。

    Trace based deallocation of entries in a versioning cache circuit
    32.
    发明授权
    Trace based deallocation of entries in a versioning cache circuit 有权
    版本缓存电路中的条目的基于跟踪的解除分配

    公开(公告)号:US08051247B1

    公开(公告)日:2011-11-01

    申请号:US12030846

    申请日:2008-02-13

    IPC分类号: G06F9/00 G06F13/00

    摘要: A circuit for tracking memory operations with trace-based execution is disclosed. Each trace includes a sequence of operations that includes zero or more of the memory operations. The memory operations being executed form a set of active memory operations that have a predefined program order among them. At least some of the active memory operations access the memory in an execution order that is different from the program order. Checkpoint entries are associated with each trace. Each entry refers to a checkpoint location. Executing one of the active memory operations updates a checkpoint location. During the operation of the circuit, none of the operations of a given trace has any effect on the execution unit's architectural state prior to committing that trace. Each trace becomes eligible for commitment after all operations in the trace complete executing. After the trace is committed, all of the checkpoint entries associated with the trace are invalidated.

    摘要翻译: 公开了一种用于跟踪基于跟踪的执行的存储器操作的电路。 每个跟踪包括包括零个或多个存储器操作的一系列操作。 正在执行的存储器操作形成一组在其间具有预定义的程序顺序的活动存储器操作。 至少一些主动存储器操作以与程序顺序不同的执行顺序访问存储器。 检查点条目与每个跟踪相关联。 每个条目指的是检查点位置。 执行其中一个活动内存操作更新检查点位置。 在电路运行期间,在提交跟踪之前,给定跟踪的任何操作都不会对执行单元的架构状态产生任何影响。 轨迹中的所有操作完成执行后,每个轨迹都将有资格获得承诺。 提交跟踪后,与跟踪相关联的所有检查点条目都将失效。

    Trace based rollback of a speculatively updated cache
    34.
    发明授权
    Trace based rollback of a speculatively updated cache 有权
    推测更新的缓存的基于跟踪的回滚

    公开(公告)号:US07877630B1

    公开(公告)日:2011-01-25

    申请号:US12030852

    申请日:2008-02-13

    IPC分类号: G06F11/00

    摘要: This invention includes a circuit for tracking memory operations with trace-based execution. Each trace includes a sequence of operations that includes zero or more of the memory operations. Traces execute atomically and become eligible for commitment after all the operations in the trace have executed. The memory operations being executed form a set of active memory operations that have a predefined program order among them and corresponding ordering constraints. At least some of the active memory operations access the memory in an execution order that is different from the program order. Checkpoint entries are associated with each trace. Each entry refers to a checkpoint location. Memory operation ordering entries correspond to each one of the active memory operations. Rollback requests result in overwriting the checkpoint locations associated with the selected trace as well as the checkpoint locations associated with traces that are younger than the selected trace.

    摘要翻译: 本发明包括用于跟踪基于跟踪的执行的存储器操作的电路。 每个跟踪包括包括零个或多个存储器操作的一系列操作。 跟踪中的所有操作执行完毕后,跟踪将以原子方式执行并成为合格的承诺。 正在执行的存储器操作形成一组活动存储器操作,它们之间具有预定义的程序顺序和相应的排序限制。 至少一些主动存储器操作以与程序顺序不同的执行顺序访问存储器。 检查点条目与每个跟踪相关联。 每个条目指的是检查点位置。 存储器操作排序条目对应于每个活动存储器操作。 回滚请求导致覆盖与所选跟踪相关联的检查点位置以及与所选跟踪较年轻的跟踪关联的检查点位置。

    Trace optimization via fusing operations of a target architecture operation set
    35.
    发明授权
    Trace optimization via fusing operations of a target architecture operation set 有权
    通过目标架构操作集的融合操作进行跟踪优化

    公开(公告)号:US07797517B1

    公开(公告)日:2010-09-14

    申请号:US11561274

    申请日:2006-11-17

    IPC分类号: G06F7/38 G06F9/00 G06F9/44

    摘要: Reference architecture instructions are translated into target architecture operations. Sequences of operations, in a predicted execution order in some embodiments, form traces. In some embodiments, a trace is based on a plurality of basic blocks. In some embodiments, a trace is committed or aborted as a single entity. Sequences of operations are optimized by fusing collections of operations; fused operations specify a same observable function as respective collections, but advantageously enable more efficient processing. In some embodiments, a collection comprises multiple register operations. Fusing a register operation with a branch operation in a trace forms a fused reg-op/branch operation. In some embodiments, branch instructions translate into assert operations. Fusing an assert operation with another operation forms a fused assert operation. In some embodiments, fused operations only set architectural state, such as high-order portions of registers, that is subsequently read before being written.

    摘要翻译: 参考架构指令被转换为目标架构操作。 在一些实施例中,以预测的执行顺序的操作顺序形成轨迹。 在一些实施例中,迹线基于多个基本块。 在一些实施例中,将跟踪作为单个实体提交或中止。 通过融合操作集合来优化操作顺序; 融合操作指定与相应集合相同的可观察函数,但有利地实现更有效的处理。 在一些实施例中,集合包括多个寄存器操作。 使用跟踪中的分支操作对寄存器操作进行融合,形成融合的操作/分支操作。 在一些实施例中,分支指令转换为断言操作。 将assert操作与另一个操作进行融合形成一个融合的assert操作。 在一些实施例中,融合操作仅设置架构状态,诸如寄存器的高阶部分,其随后在写入之前被读取。

    Executing functions determined via a collection of operations from translated instructions
    36.
    发明授权
    Executing functions determined via a collection of operations from translated instructions 有权
    通过经翻译的指令的操作集合确定的执行功能

    公开(公告)号:US07681019B1

    公开(公告)日:2010-03-16

    申请号:US11561287

    申请日:2006-11-17

    IPC分类号: G06F9/455

    CPC分类号: G06F9/4552

    摘要: Reference architecture instructions are translated into target architecture operations. In some embodiments, an execution unit of a processor executes a function determined from a collection of operations, the function specifying functionality based on instructions, the collection selected from operations translated from the instructions. In further embodiments, the function is specified as a fused operation. Sequences of operations are optimized by fusing collections of operations; fused operations specify a same observable function as respective collections, but advantageously enable more efficient processing. In some embodiments, a collection comprises multiple register operations. Sequences of operations, in a predicted execution order in some embodiments, form traces. In some embodiments, fusing operations requires setting only final architectural state, such as final flag state; intermediate architectural state is used implicitly in a fused operation. In some embodiments, fused operations only set architectural state, such as high-order portions of registers, that is subsequently read before being written.

    摘要翻译: 参考架构指令被转换为目标架构操作。 在一些实施例中,处理器的执行单元执行从操作集合确定的功能,基于指令的功能指定功能,从指令转换的操作中选择的集合。 在另外的实施例中,该功能被指定为融合操作。 通过融合操作集合来优化操作顺序; 融合操作指定与相应集合相同的可观察函数,但有利地实现更有效的处理。 在一些实施例中,集合包括多个寄存器操作。 在一些实施例中,以预测的执行顺序的操作顺序形成轨迹。 在一些实施例中,融合操作需要仅设置最终架构状态,例如最终标志状态; 中间架构状态被隐含地用于融合操作。 在一些实施例中,融合操作仅设置架构状态,诸如寄存器的高阶部分,其随后在写入之前被读取。

    Microprocessor modified to perform inverse discrete cosine transform
operations on a one-dimensional matrix of numbers within a minimal
number of instructions
    38.
    发明授权
    Microprocessor modified to perform inverse discrete cosine transform operations on a one-dimensional matrix of numbers within a minimal number of instructions 有权
    微处理器被修改为对最小数量的指令中的数字的一维矩阵执行逆离散余弦变换操作

    公开(公告)号:US6141673A

    公开(公告)日:2000-10-31

    申请号:US318671

    申请日:1999-05-25

    摘要: A multimedia extension unit (MEU) is provided for performing various multimedia-type operations. The MEU can be coupled either through a coprocessor bus or a local central processing unit (CPU) bus to a conventional processor. The MEU employs vector registers, a vector arithmetic logic unit (ALU), and an operand routing unit (ORU) to perform a maximum number of the multimedia operations within as few instruction cycles as possible. Complex algorithms are readily performed by arranging operands upon the vector ALU in accordance with the desired algorithm flowgraph. The ORU aligns the operands within partitioned slots or sub-slots of the vector registers using vector instructions unique to the MEU. At the output of the ORU, operand pairs from vector source or destination registers can be easily routed and combined at the vector ALU. The vector instructions employ special load/store instructions in combination with numerous operational instructions to carry out concurrent multimedia operations on the aligned operands.

    摘要翻译: 提供多媒体扩展单元(MEU)用于执行各种多媒体类型操作。 MEU可以通过协处理器总线或本地中央处理单元(CPU)总线耦合到常规处理器。 MEU采用向量寄存器,向量算术逻辑单元(ALU)和操作数路由单元(ORU),以尽可能少的指令周期执行最大数量的多媒体操作。 通过根据期望的算法流程图将操作数布置在向量ALU上来容易地执行复杂算法。 ORU使用MAU特有的向量指令对齐向量寄存器的分区插槽或子时隙内的操作数。 在ORU的输出端,矢量源或目标寄存器的操作数对可以很容易地在矢量ALU中路由和组合。 向量指令采用特殊的加载/存储指令与许多操作指令相结合,对对齐的操作数执行并发的多媒体操作。

    Broadcast messaging and acknowledgment messaging for power management in a multiprocessor system
    40.
    发明授权
    Broadcast messaging and acknowledgment messaging for power management in a multiprocessor system 有权
    多处理器系统中的电源管理的广播消息和确认消息

    公开(公告)号:US09213643B2

    公开(公告)日:2015-12-15

    申请号:US13799268

    申请日:2013-03-13

    摘要: Various aspects provide for implementing a cache coherence protocol. A system comprises at least one processing component and a centralized controller. The at least one processing component comprises a cache controller. The cache controller is configured to manage a cache memory associated with a processor. The centralized controller is configured to communicate with the cache controller based on a power state of the processor.

    摘要翻译: 各个方面提供了实现高速缓存一致性协议。 系统包括至少一个处理部件和集中控制器。 所述至少一个处理组件包括高速缓存控制器。 高速缓存控制器被配置为管理与处理器相关联的高速缓冲存储器。 集中控制器被配置为基于处理器的功率状态与高速缓存控制器进行通信。