-
公开(公告)号:US11157409B2
公开(公告)日:2021-10-26
申请号:US16717868
申请日:2019-12-17
Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
Inventor: Derek E. Williams , Guy L. Guthrie , Hugh Shen , Luke Murray
IPC: G06F12/08 , G06F12/0817 , G06F12/0891 , G06F12/0842
Abstract: A cache memory includes a data array, a directory of contents of the data array that specifies coherence state information, and snoop logic that processes operations snooped from a system fabric by reference to the data array and the directory. The snoop logic, responsive to snooping on the system fabric a request of a first flush/clean memory access operation that specifies a target address, determines whether or not the cache memory has coherence ownership of the target address. Based on determining the cache memory has coherence ownership of the target address, the snoop logic services the request and thereafter enters a referee mode. While in the referee mode, the snoop logic protects a memory block identified by the target address against conflicting memory access requests by the plurality of processor cores until conclusion of a second flush/clean memory access operation that specifies the target address.
-
2.
公开(公告)号:US11573902B1
公开(公告)日:2023-02-07
申请号:US17405713
申请日:2021-08-18
Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
Inventor: Hugh Shen , Guy L. Guthrie , Jeffrey A. Stuecheli , Luke Murray , Alexander Michael Taft , Bernard C. Drerup , Derek E. Williams
IPC: G06F12/08 , G06F3/06 , G06F12/0862 , G06F12/0811 , G06F12/0817
Abstract: A coherent data processing system includes a system fabric communicatively coupling a plurality of coherence participants and fabric control logic. The fabric control logic quantifies congestion on the system fabric based on coherence messages associated with commands issued on the system fabric. Based on the congestion on the system fabric, the fabric control logic determines a rate of request issuance applicable to a set of coherence participants among the plurality of coherence participants. The fabric control logic issues at least one rate command to set a rate of request issuance to the system fabric of the set of coherence participants.
-
公开(公告)号:US11354243B1
公开(公告)日:2022-06-07
申请号:US16950511
申请日:2020-11-17
Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
Inventor: Derek E. Williams , Guy L. Guthrie , Hugh Shen , Sanjeev Ghai , Luke Murray
IPC: G06F12/08 , G06F13/40 , G06F9/30 , G06F12/0815 , G06F30/327 , G06F30/33
Abstract: A processing unit for a data processing system includes a processor core that issues memory access requests and a cache memory coupled to the processor core. The cache memory includes a reservation circuit that tracks reservations established by the processor core via load-reserve requests and a plurality of read-claim (RC) state machines for servicing memory access requests of the processor core. The cache memory, responsive to receipt from the processor core of a store-conditional request specifying a store target address, allocates an RC state machine among the plurality of RC state machines to process the store-conditional request and transfers responsibility for tracking a reservation for the store target address from the reservation circuit to the RC state machine.
-
公开(公告)号:US11163700B1
公开(公告)日:2021-11-02
申请号:US16862785
申请日:2020-04-30
Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
Inventor: Derek E. Williams , Guy L. Guthrie , Hugh Shen , Luke Murray
IPC: G06F12/128
Abstract: An upper level cache receives from an associated processor core a plurality of memory access requests including at least first and second memory access requests of differing first and second classes. Based on class histories associated with the first and second classes of memory access requests, the upper level cache initiates, on the system interconnect fabric, a first interconnect transaction corresponding to the first memory access request without first issuing the first memory access request to the lower level cache via a private communication channel between the upper level cache and the lower level cache. The upper level cache initiates, on the system interconnect fabric, a second interconnect transaction corresponding to the second memory access request only after first issuing the second memory access request to the lower level cache via the private communication channel between the upper level cache and the lower level cache and receiving a response to the second memory access request from the lower level cache.
-
公开(公告)号:US11693788B1
公开(公告)日:2023-07-04
申请号:US17834505
申请日:2022-06-07
Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
Inventor: Derek E. Williams , Guy L. Guthrie , Luke Murray , Hugh Shen
IPC: G06F12/1045 , G06F12/0811 , G06F12/1027
CPC classification number: G06F12/1045 , G06F12/0811 , G06F12/1027 , G06F2212/681
Abstract: An arbiter gathers translation invalidation requests assigned to state machines of a lower-level cache into a set for joint handling in a processor core. The gathering includes selection of one of the set of gathered translation invalidation requests as an end-of-sequence (EOS) request. The arbiter issues to the processor core a sequence of the gathered translation invalidation requests terminating with the EOS request. Based on receipt of each of the gathered requests, the processor core invalidates any translation entries providing translation for the addresses specified by the translation invalidation requests and marks memory-referent requests dependent on the invalidated translation entries. Based on receipt of the EOS request and in response to all of the marked memory-referent requests draining from the processor core, the processor core issues a completion request to the lower-level cache indicating completion of servicing by the processor core of the set of gathered translation invalidation requests.
-
公开(公告)号:US20230061030A1
公开(公告)日:2023-03-02
申请号:US17465228
申请日:2021-09-02
Applicant: International Business Machines Corporation
Inventor: Bryan Lloyd , Guy L. Guthrie , Susan E. Eisen , Dhivya Jeganathan , Luke Murray
Abstract: A first instruction for processing by a processor core is received. Whether the instruction is a larx is determined. Responsive to determining the instruction is a larx, whether a cacheline associated with the larx is locked is determined. Responsive to determining the cacheline associated with the larx is not locked, the cacheline associated with the larx is locked and a counter associated with a first thread of the processor core is started. The first thread is processing the first instruction.
-
公开(公告)号:US20230041702A1
公开(公告)日:2023-02-09
申请号:US17394173
申请日:2021-08-04
Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
Inventor: DEREK E. WILLIAMS , GUY L. GUTHRIE , Bernard C. Drerup , Hugh Shen , Alexander Michael Taft , Luke Murray , Richard Nicholas
IPC: G06F12/0811 , G06F12/121 , G06F30/32
Abstract: A data processing system includes a plurality of processor cores each supported by a respective one of a plurality of vertical cache hierarchies. Based on receiving on a system fabric a cache injection request requesting injection of a data into a cache line identified by a target real address, the data is written into a cache in a first vertical cache hierarchy among the plurality of vertical cache hierarchies. Based on a value in a field of the cache injection request, a distribute field is set in a directory entry of the first vertical cache hierarchy. Upon eviction of the cache line the first vertical cache hierarchy, a determination is made whether the distribute field is set. Based on determining the distribute field is set, a lateral castout of the cache line from the first vertical cache hierarchy to a second vertical cache hierarchy is performed.
-
公开(公告)号:US11561901B1
公开(公告)日:2023-01-24
申请号:US17394173
申请日:2021-08-04
Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
Inventor: Derek E. Williams , Guy L. Guthrie , Bernard C. Drerup , Hugh Shen , Alexander Michael Taft , Luke Murray , Richard Nicholas
IPC: G06F12/0811 , G06F30/32 , G06F12/121
Abstract: A data processing system includes a plurality of processor cores each supported by a respective one of a plurality of vertical cache hierarchies. Based on receiving on a system fabric a cache injection request requesting injection of a data into a cache line identified by a target real address, the data is written into a cache in a first vertical cache hierarchy among the plurality of vertical cache hierarchies. Based on a value in a field of the cache injection request, a distribute field is set in a directory entry of the first vertical cache hierarchy. Upon eviction of the cache line the first vertical cache hierarchy, a determination is made whether the distribute field is set. Based on determining the distribute field is set, a lateral castout of the cache line from the first vertical cache hierarchy to a second vertical cache hierarchy is performed.
-
公开(公告)号:US12056050B2
公开(公告)日:2024-08-06
申请号:US18086501
申请日:2022-12-21
Applicant: International Business Machines Corporation
Inventor: Derek E. Williams , Luke Murray , Guy L. Guthrie , Hugh Shen
IPC: G06F12/0802 , G06F9/54 , G06F12/10
CPC classification number: G06F12/0802 , G06F9/542 , G06F12/10 , G06F2212/603
Abstract: A data processing system includes a master, a central request agent, and a plurality of snoopers communicatively coupled to a system fabric for communicating requests subject to retry. The master issues on the system fabric a multicast request intended for the plurality of snoopers. The central request agent receives the multicast request on the system fabric, assigns the multicast request to a particular state machine among a plurality of state machines in the central request agent, and provides the master a coherence response indicating successful completion of the multicast request. The central request agent repetitively issues on the system fabric a multicast request in association with a machine identifier identifying the particular state machine until a coherence response indicates the multicast request is successfully received by all of the plurality of snoopers.
-
公开(公告)号:US11775337B2
公开(公告)日:2023-10-03
申请号:US17465228
申请日:2021-09-02
Applicant: International Business Machines Corporation
Inventor: Bryan Lloyd , Guy L. Guthrie , Susan E. Eisen , Dhivya Jeganathan , Luke Murray
IPC: G06F16/176 , G06F9/52 , G06F9/48 , G06F9/30
CPC classification number: G06F9/4818 , G06F9/30043 , G06F9/30087 , G06F9/526
Abstract: A first instruction for processing by a processor core is received. Whether the instruction is a larx is determined. Responsive to determining the instruction is a larx, whether a cacheline associated with the larx is locked is determined. Responsive to determining the cacheline associated with the larx is not locked, the cacheline associated with the larx is locked and a counter associated with a first thread of the processor core is started. The first thread is processing the first instruction.
-
-
-
-
-
-
-
-
-