Low Latency Clock Gating Scheme for Power Reduction in Bus Interconnects
    1.
    发明申请
    Low Latency Clock Gating Scheme for Power Reduction in Bus Interconnects 审中-公开
    总线互连功率降低的低延迟时钟门控方案

    公开(公告)号:US20130117593A1

    公开(公告)日:2013-05-09

    申请号:US13290250

    申请日:2011-11-07

    IPC分类号: G06F1/32

    摘要: A System-on-a-Chip (SoC) comprising a controller, an activity counter, a reference pattern detection logic, a master pattern detection logic, an arbiter, a comparator, a tracker circuit, a delay cell circuit, and a request mask circuit coupled to a bus. The bus is configured to support master control. The controller is configured to cause components to enter a low power state. The activity counter is configured to monitor activity. The detection logics are configured to operate on an activity based clock or always on clock. The arbiter is configured to select an initiator. The comparator is configured to compare the output of the detection logics. The tracker circuit is configured to track selection of components. The delay cell circuit is configured to store output of components. The request mask circuit is configured to prevent request to arbiter or any arbiter selected request made from a previous clock cycle.

    摘要翻译: 一种片上系统(SoC),包括控制器,活动计数器,参考模式检测逻辑,主模式检测逻辑,仲裁器,比较器,跟踪器电路,延迟单元电路和请求掩码 电路耦合到总线。 总线配置为支持主控制。 控制器被配置为使组件进入低功率状态。 活动计数器配置为监视活动。 检测逻辑被配置为在基于活动的时钟上操作或者始终处于时钟上。 仲裁器被配置为选择启动器。 比较器配置为比较检测逻辑的输出。 跟踪器电路被配置为跟踪组件的选择。 延迟单元电路被配置为存储组件的输出。 请求屏蔽电路被配置为防止对仲裁器的请求或从先前时钟周期进行的任何仲裁器选择的请求。

    Weakly ordered processing systems and methods
    4.
    发明授权
    Weakly ordered processing systems and methods 有权
    处理系统和方法薄弱

    公开(公告)号:US07921249B2

    公开(公告)日:2011-04-05

    申请号:US12561381

    申请日:2009-09-17

    IPC分类号: G06F13/00

    摘要: The disclosure is directed to a weakly-ordered processing system and method of executing memory barriers in weakly-ordered processing system. The processing system includes memory and a master device configured to issue memory access requests, including memory barriers, to the memory. The processing system also includes a slave device configured to provide the master device access to the memory, the slave device being further configured to produce a signal indicating that an ordering constraint imposed by a memory barrier issued by the master device will be enforced, the signal being produced before the execution of all memory access requests issued by the master device to the memory before the memory barrier.

    摘要翻译: 本公开涉及一种弱有序处理系统和在弱有序处理系统中执行存储障碍的方法。 处理系统包括存储器和被配置为向存储器发出存储器访问请求(包括存储器障碍)的主设备。 处理系统还包括被配置为提供主设备对存储器的访问的从设备,从设备还被配置为产生指示由主设备发布的存储器障碍施加的排序约束将被强制的信号,信号 在执行由主设备发出的所有存储器访问请求之前在存储器屏障之前被产生到存储器。

    Efficient execution of memory barrier bus commands with order constrained memory accesses
    5.
    发明授权
    Efficient execution of memory barrier bus commands with order constrained memory accesses 有权
    具有订单约束的存储器访问的存储器障碍总线命令的高效执行

    公开(公告)号:US07917676B2

    公开(公告)日:2011-03-29

    申请号:US11397287

    申请日:2006-04-04

    IPC分类号: G06F13/00

    摘要: The disclosure is directed to a weakly-ordered processing system and method of executing memory barriers in weakly-ordered processing system. The processing system includes memory and a master device configured to issue memory access requests, including memory barriers, to the memory. The processing system also includes a slave device configured to provide the master device access to the memory, the slave device being further configured to produce a signal indicating that an ordering constraint imposed by a memory barrier issued by the master device will be enforced, the signal being produced before the execution of all memory access requests issued by the master device to the memory before the memory barrier.

    摘要翻译: 本公开涉及一种弱有序处理系统和在弱有序处理系统中执行存储障碍的方法。 处理系统包括存储器和被配置为向存储器发出存储器访问请求(包括存储器障碍)的主设备。 处理系统还包括被配置为提供主设备对存储器的访问的从设备,从设备还被配置为产生指示由主设备发布的存储器障碍施加的排序约束将被强制的信号,信号 在执行由主设备发出的所有存储器访问请求之前在存储器屏障之前被产生到存储器。

    Dynamic cache coherency snooper presence with variable snoop latency
    6.
    发明授权
    Dynamic cache coherency snooper presence with variable snoop latency 有权
    动态缓存一致性snooper存在与可变侦听延迟

    公开(公告)号:US06985972B2

    公开(公告)日:2006-01-10

    申请号:US10264163

    申请日:2002-10-03

    IPC分类号: G06F13/28 G06F12/00

    摘要: A data processing system with a snooper that is capable of dynamically enabling and disabling its snooping capabilities (i.e., snoop detect and response). The snooper is connected to a bus controller via a plurality of interconnects, including a snooperPresent signal, a snoop response signal and a snoop detect signal. When the snooperPresent signal is asserted, subsequent snoop requests are sent to the snooper, and the snooper is polled for a snoop response. Each snooper is capable of responding at different times (i.e., each snooper operates with different snoop latencies). The bus controller individually tracks the snoop response received from each snooper with the snooperPresent signal enabled. Whenever the snooper wishes to deactivate its snooping capabilities/operations, the snooper de-asserts the snooperPresent signal. The bus controller recognizes this as an indication that the snooper is unavailable. Thus, when the bus controller broadcasts subsequent snoop requests, the bus controller does not send the snoop request to the snooper.

    摘要翻译: 具有能够动态地启用和禁用其窥探能力(即,窥探检测和响应)的窥探者的数据处理系统。 窥探者通过多个互连连接到总线控制器,包括窥探信号,窥探响应信号和窥探检测信号。 当snooperPresent信号被断言时,后续的窥探请求被发送到snooper,并且窥探者被轮询以进行侦听响应。 每个窥探者都能够在不同的时间进行响应(即,每个窥探者使用不同的侦听延迟进行操作)。 总线控制器单独跟踪snooperPresent信号启用时从每个窥探者接收的窥探响应。 只要窥探者希望取消其窥探能力/操作,窥探者将断言snooperPresent信号。 总线控制器将此识别为snooper不可用的指示。 因此,当总线控制器广播后续的窥探请求时,总线控制器不向窥探者发送窥探请求。

    Scalable on-chip bus performance monitoring synchronization mechanism and method of use
    7.
    发明授权
    Scalable on-chip bus performance monitoring synchronization mechanism and method of use 有权
    可扩展的片上总线性能监控同步机制和使用方法

    公开(公告)号:US06857029B2

    公开(公告)日:2005-02-15

    申请号:US10137084

    申请日:2002-04-30

    IPC分类号: G06F1/12 G06F13/00

    CPC分类号: G06F1/12

    摘要: A bus performance monitoring mechanism for systems on a chip (SOC) is disclosed. The system comprises a muxing logic adapted to be coupled to a plurality of master devices, a plurality of slave devices, a plurality of generic signals and a plurality of control signals. The monitoring mechanism includes a plurality of control registers coupled to the muxing logic to allow for the selection of master, slave, generic and pipeline stage events to be counted. Finally, the monitoring mechanism includes synchronizing logic coupled to the plurality of registers for providing and receiving synchronizing signals to and from the monitors coupled thereto to allow for scalability. The scalable on-chip bus performance monitoring system in accordance with the present invention performs on-chip bus monitoring within a SOC implementation, while eliminating the pitfalls as described above. Through a minimalistic design approach, scalability is easily accomplished through the concept of using multiple monitor instances of these monitoring mechanisms within an SOC design while maintaining synchronization among them. Should an SOC design increase in size, scalability is achieved by simply adding additional monitor instance(s). The multiple monitor instances could then be connected in a “lego-like” fashion, allowing each to operate independently, or concurrently with one another via a scalable synchronization technique. For these designs where multiple monitor instances may be required, this enhances wireability by allowing the SOC designer to scatter the monitor instance locations virtually anywhere within the smaller areas of unused chip space, and simply wire the synchronization signals among the monitor instances to allow for synchronous operation.

    摘要翻译: 公开了一种用于芯片系统(SOC)的总线性能监视机制。 该系统包括适于耦合到多个主设备,多个从设备,多个通用信号和多个控制信号的多路复用逻辑。 监视机制包括耦合到多路复用逻辑的多个控制寄存器,以允许对主,从,通用和流水线级事件的选择进行计数。 最后,监视机制包括与多个寄存器耦合的同步逻辑,用于提供和接收与耦合到其的监视器同步信号以允许可扩展性。 根据本发明的可扩展的片上总线性能监视系统在SOC实现中执行片上总线监视,同时消除如上所述的陷阱。 通过简约的设计方法,可以通过在SOC设计中使用这些监视机制的多个监视器实例,同时保持它们之间的同步来实现可扩展性。 如果SOC设计的大小增加,则通过简单地添加附加的监视器实例来实现可扩展性。 然后,多个监视器实例可以以“lego-like”的方式连接,允许每个监视器实例通过可伸缩的同步技术独立地或彼此并行操作。 对于可能需要多个监视器实例的这些设计,通过允许SOC设计人员将监视器实例位置实际上分散在未使用的芯片空间的较小区域内的任何位置,从而提高了可线性,并且简单地将监视器实例之间的同步信号连接起来以允许同步 操作。

    Bus clock frequency scaling for a bus interconnect and related devices, systems, and methods
    8.
    发明授权
    Bus clock frequency scaling for a bus interconnect and related devices, systems, and methods 有权
    总线互连及相关设备,系统和方法的总线时钟频率缩放

    公开(公告)号:US09286257B2

    公开(公告)日:2016-03-15

    申请号:US13015657

    申请日:2011-01-28

    摘要: Bus clock frequency scaling for a bus interconnect and related devices, systems, and methods are disclosed. In one embodiment, the bus interconnect comprises an interconnect network configurable to connect a master port(s) to a slave port(s). A bus interconnect clock signal clocks the interconnect network. The controller is configured to receive bandwidth information related to traffic communicated over the master port(s) and the slave port(s). The controller is further configured to scale (e.g., increase or decrease) the frequency of the bus interconnect clock signal if the bandwidth of the master port(s) and/or the slave port(s) meets respective bandwidth condition(s), and/or if the latency of the master port(s) meets a respective latency condition(s) for the master port(s). The master port(s) and/or slave port(s) can also be reconfigured in response to a change in frequency of the bus interconnect clock signal to optimize performance and conserve power.

    摘要翻译: 公布了总线互连及相关设备,系统和方法的总线时钟频率缩放。 在一个实施例中,总线互连包括可配置为将主端口连接到从端口的互连网络。 总线互连时钟信号对互连网络进行时钟。 控制器被配置为接收与通过主端口和从端口传送的业务相关的带宽信息。 如果主端口和/或从​​端口的带宽满足相应的带宽条件,则控制器进一步被配置为扩展(例如,增加或减少)总线互连时钟信号的频率,以及 /或如果主端口的延迟满足主端口的相应等待时间条件。 主端口和/或从​​端口也可以被重新配置以响应总线互连时钟信号的频率变化,以优化性能并节省功率。

    Efficient Execution of Memory Barrier Bus Commands
    10.
    发明申请
    Efficient Execution of Memory Barrier Bus Commands 有权
    高效执行内存障碍总线命令

    公开(公告)号:US20100005208A1

    公开(公告)日:2010-01-07

    申请号:US12561381

    申请日:2009-09-17

    IPC分类号: G06F13/00

    摘要: The disclosure is directed to a weakly-ordered processing system and method of executing memory barriers in weakly-ordered processing system. The processing system includes memory and a master device configured to issue memory access requests, including memory barriers, to the memory. The processing system also includes a slave device configured to provide the master device access to the memory, the slave device being further configured to produce a signal indicating that an ordering constraint imposed by a memory barrier issued by the master device will be enforced, the signal being produced before the execution of all memory access requests issued by the master device to the memory before the memory barrier.

    摘要翻译: 本公开涉及一种弱有序处理系统和在弱有序处理系统中执行存储障碍的方法。 处理系统包括存储器和被配置为向存储器发出存储器访问请求(包括存储器障碍)的主设备。 处理系统还包括被配置为提供主设备对存储器的访问的从设备,从设备还被配置为产生指示由主设备发布的存储器障碍施加的排序约束将被强制的信号,信号 在执行由主设备发出的所有存储器访问请求之前在存储器屏障之前被产生到存储器。