Patent search ap:("Apple Inc.") AND inv:"Robert D. Kenney" Page 2

11.

发明授权
Multi-channel data path circuitry 有权

公开(公告)号：US11422822B2

公开(公告)日：2022-08-23

申请号：US16870330

申请日：2020-05-08

Applicant: Apple Inc.

Inventor： Robert D. Kenney , Jason N. Dale

IPC: G06F9/38 , G06T1/20 , G06F9/54 , G06F15/80

Abstract: Techniques are disclosed relating to sharing datapath circuitry among multiple SIMD groups. In some embodiments, pipeline circuitry is configured to perform operations specified by instructions of first and second assigned SIMD groups. The pipeline circuitry may include first and second front-end circuitry configured to decode instructions of the respective SIMD groups. The pipeline circuitry may include shared execution circuitry configured to perform operations specified by the first and second assigned SIMD groups and arbitration circuitry configured to select an instruction from among at least the first and second front-end circuitry for assignment to the shared execution circuitry in a current cycle. The arbitration circuitry may select an instruction based on one or more of: stall counts, whether available instructions are being speculatively executed, whether ones of available instructions target a particular portion of the shared execution circuitry, numbers of execution cycles, and SIMD group ages.

12.

发明授权
Register file arbitration 有权

公开(公告)号：US11080055B2

公开(公告)日：2021-08-03

申请号：US16548797

申请日：2019-08-22

Applicant: Apple Inc.

Inventor： Robert D. Kenney , Terence M. Potter

IPC: G06F9/30

Abstract: Techniques are disclosed relating to arbitration among register file accesses. In some embodiments, an apparatus includes a register file configured to store operands for multiple client circuits and arbitration circuitry configured to select from among multiple received requests to access the register file. In some embodiments, the apparatus includes first interface circuitry configured to provide access requests from a first client circuit to the arbitration circuitry and supplemental interface circuitry configured to receive unsuccessful requests from the first client circuit and provide the received unsuccessful requests to the arbitration circuitry. The supplemental interface circuitry may provide additional catch-up bandwidth to clients that lose arbitration, which may result in fairness during bandwidth shortages.

13.

发明申请
SIMD Operand Permutation with Selection from among Multiple Registers 有权

公开(公告)号：US20210149679A1

公开(公告)日：2021-05-20

申请号：US16686060

申请日：2019-11-15

Applicant: Apple Inc.

Inventor： Christopher A. Burns , Liang-Kai Wang , Robert D. Kenney , Terence M. Potter

IPC: G06F9/38 , G06F9/30 , G06T1/20 , G06T1/60

Abstract: Techniques are disclosed relating to operand routing among SIMD pipelines. In some embodiments, an apparatus includes a set of multiple hardware pipelines configured to execute a single-instruction multiple-data (SIMD) instruction for multiple threads in parallel, wherein the instruction specifies first and second architectural registers. In some embodiments, the pipelines include execution circuitry configured to perform operations using one or more pipeline stages of the pipeline. In some embodiments, the pipelines include routing circuitry configured to select, based on the instruction, a first input operand for the execution circuitry from among: a value from the first architectural register from thread-specific storage for another pipeline and a value from the second architectural register from thread-specific storage for a thread assigned to another pipeline. In some embodiments, the routing circuitry may support a shift and fill instruction that facilitates storage of an arbitrary portion of a graphics frame in one or more registers.

14.

发明申请
Routing Circuitry for Permutation of Single-Instruction Multiple-Data Operands 有权

公开(公告)号：US20210055931A1

公开(公告)日：2021-02-25

申请号：US16548812

申请日：2019-08-22

Applicant: Apple Inc.

Inventor： Robert D. Kenney , Liang-Kai Wang , Terence M. Potter

IPC: G06F9/30 , G06F9/38 , G06F9/54

Abstract: Techniques are disclosed relating to routing circuitry configured to perform permute operations for operands of threads in a single-instruction multiple-data group. In some embodiments, an apparatus includes hierarchical operand routing circuitry configured to route operands between a set of single-instruction multiple-data (SIMD) pipelines based on a permute instruction. In some embodiments, the routing circuitry includes a first level and a second level. The first level may include a set of multiple crossbar circuits each configured to receive operands from a respective subset of the pipelines and output one or more of the received operands on multiple output lines based on the permute instruction, where the crossbar circuits support full permutation within a respective subset. A second level may be configured to select an operand from a previous level for each of the pipelines, and may select from among only a portion of output operands from the previous level to provide an operand for a respective pipeline.

15.

发明申请
Register File Arbitration 有权

公开(公告)号：US20210055929A1

公开(公告)日：2021-02-25

申请号：US16548797

申请日：2019-08-22

Applicant: Apple Inc.

Inventor： Robert D. Kenney , Terence M. Potter

IPC: G06F9/30 , G06F13/16 , G06F9/54

Abstract: Techniques are disclosed relating to arbitration among register file accesses. In some embodiments, an apparatus includes a register file configured to store operands for multiple client circuits and arbitration circuitry configured to select from among multiple received requests to access the register file. In some embodiments, the apparatus includes first interface circuitry configured to provide access requests from a first client circuit to the arbitration circuitry and supplemental interface circuitry configured to receive unsuccessful requests from the first client circuit and provide the received unsuccessful requests to the arbitration circuitry. The supplemental interface circuitry may provide additional catch-up bandwidth to clients that lose arbitration, which may result in fairness during bandwidth shortages.

16.

发明授权
SIMD operand permutation with selection from among multiple registers 有权

公开(公告)号：US12008377B2

公开(公告)日：2024-06-11

申请号：US18299452

申请日：2023-04-12

Applicant: Apple Inc.

Inventor： Christopher A. Burns , Liang-Kai Wang , Robert D. Kenney , Terence M. Potter

IPC: G06F15/80 , G06F9/30 , G06F9/38 , G06T1/20 , G06T1/60

CPC classification number: G06F9/3887 , G06F9/30098 , G06T1/20 , G06T1/60

Abstract: Techniques are disclosed relating to operand routing among SIMD pipelines. In some embodiments, an apparatus includes a set of multiple hardware pipelines configured to execute a single-instruction multiple-data (SIMD) instruction for multiple threads in parallel, wherein the instruction specifies first and second architectural registers. In some embodiments, the pipelines include execution circuitry configured to perform operations using one or more pipeline stages of the pipeline. In some embodiments, the pipelines include routing circuitry configured to select, based on the instruction, a first input operand for the execution circuitry from among: a value from the first architectural register from thread-specific storage for another pipeline and a value from the second architectural register from thread-specific storage for a thread assigned to another pipeline. In some embodiments, the routing circuitry may support a shift and fill instruction that facilitates storage of an arbitrary portion of a graphics frame in one or more registers.

17.

发明授权
SIMD operand permutation with selection from among multiple registers 有权

公开(公告)号：US11126439B2

公开(公告)日：2021-09-21

申请号：US16686060

申请日：2019-11-15

Applicant: Apple Inc.

Inventor： Christopher A. Burns , Liang-Kai Wang , Robert D. Kenney , Terence M. Potter

IPC: G06F15/80 , G06F9/38 , G06T1/60 , G06T1/20 , G06F9/30

Abstract: Techniques are disclosed relating to operand routing among SIMD pipelines. In some embodiments, an apparatus includes a set of multiple hardware pipelines configured to execute a single-instruction multiple-data (SIMD) instruction for multiple threads in parallel, wherein the instruction specifies first and second architectural registers. In some embodiments, the pipelines include execution circuitry configured to perform operations using one or more pipeline stages of the pipeline. In some embodiments, the pipelines include routing circuitry configured to select, based on the instruction, a first input operand for the execution circuitry from among: a value from the first architectural register from thread-specific storage for another pipeline and a value from the second architectural register from thread-specific storage for a thread assigned to another pipeline. In some embodiments, the routing circuitry may support a shift and fill instruction that facilitates storage of an arbitrary portion of a graphics frame in one or more registers.

18.

发明申请
Datapath Circuitry for Math Operations using SIMD Pipelines 有权

公开(公告)号：US20210109761A1

公开(公告)日：2021-04-15

申请号：US16597625

申请日：2019-10-09

Applicant: Apple Inc.

Inventor： Liang-Kai Wang , Robert D. Kenney , Terence M. Potter , Vinod Reddy Nalamalapu , Sivayya V. Ayinala

IPC: G06F9/38 , G06F9/30 , G06F17/16

Abstract: Techniques are disclosed relating to sharing operands among SIMD threads for a larger arithmetic operation. In some embodiments, a set of multiple hardware pipelines is configured to execute single-instruction multiple-data (SIMD) instructions for multiple threads in parallel, where ones of the hardware pipelines include execution circuitry configured to perform floating-point operations using one or more pipeline stages of the pipeline and first routing circuitry configured to select, from among thread-specific operands stored for the hardware pipeline and from one or more other pipelines in the set, a first input operand for an operation by the execution circuitry. In some embodiments, a device is configured to perform a mathematical operation on source input data structures stored across thread-specific storage for the set of hardware pipelines, by executing multiple SIMD floating-point operations using the execution circuitry and the first routing circuitry. This may improve performance and reduce power consumption for matrix multiply and reduction operations, for example.

19.

发明授权
Pipelined allocation for operand cache 有权

公开(公告)号：US10678548B2

公开(公告)日：2020-06-09

申请号：US16112614

申请日：2018-08-24

Applicant: Apple Inc.

Inventor： Robert D. Kenney , Terence M. Potter , Andrew M. Havlir , Sivayya V. Ayinala

IPC: G06F9/30 , G06F9/52 , G06F9/38

Abstract: Techniques are disclosed relating to controlling an operand cache in a pipelined fashion. An operand cache may cache operands fetched from the register file or generated by previous instructions to improve performance and/or reduce power consumption. In some embodiments, instructions are pipelined and separate tag information is maintained to indicate allocation of an operand cache entry and ownership of the operand cache entry. In some embodiments, this may allow an operand to remain in the operand cache (and potentially be retrieved or modified) during an interval between allocation of the entry for another operand and ownership of the entry by the other operand. This may improve operand cache efficiency by allowing the entry to be used while to retrieving the other operand from the register file, for example.

20.

发明申请
Pipelined Allocation for Operand Cache 审中-公开

公开(公告)号：US20200065104A1

公开(公告)日：2020-02-27

申请号：US16112614

申请日：2018-08-24

Applicant: Apple Inc.

Inventor： Robert D. Kenney , Terence M. Potter , Andrew M. Havlir , Sivayya V. Ayinala

IPC: G06F9/38 , G06F9/30

Abstract: Techniques are disclosed relating to controlling an operand cache in a pipelined fashion. An operand cache may cache operands fetched from the register file or generated by previous instructions to improve performance and/or reduce power consumption. In some embodiments, instructions are pipelined and separate tag information is maintained to indicate allocation of an operand cache entry and ownership of the operand cache entry. In some embodiments, this may allow an operand to remain in the operand cache (and potentially be retrieved or modified) during an interval between allocation of the entry for another operand and ownership of the entry by the other operand. This may improve operand cache efficiency by allowing the entry to be used while to retrieving the other operand from the register file, for example.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification