Patent search cpc:"G06F9/30098" Page 2

11.

发明申请
STORE NULLIFICATION IN THE TARGET FIELD 审中-公开
Title translation: 目标领域的存储失效

公开(公告)号：WO2017048641A1

公开(公告)日：2017-03-23

申请号：PCT/US2016/051402

申请日：2016-09-13

Applicant: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventor： BURGER, Douglas C. , SMITH, Aaron L.

IPC: G06F9/38 , G06F9/30

CPC classification number: G06F9/3016 , G06F9/268 , G06F9/30007 , G06F9/30021 , G06F9/30036 , G06F9/3004 , G06F9/30043 , G06F9/30047 , G06F9/3005 , G06F9/30058 , G06F9/30072 , G06F9/30076 , G06F9/30087 , G06F9/3009 , G06F9/30098 , G06F9/30101 , G06F9/30105 , G06F9/3013 , G06F9/30145 , G06F9/30167 , G06F9/30189 , G06F9/32 , G06F9/321 , G06F9/345 , G06F9/35 , G06F9/355 , G06F9/3557 , G06F9/3802 , G06F9/3804 , G06F9/3822 , G06F9/3824 , G06F9/383 , G06F9/3836 , G06F9/3838 , G06F9/3842 , G06F9/3848 , G06F9/3851 , G06F9/3853 , G06F9/3855 , G06F9/3859 , G06F9/3867 , G06F9/3891 , G06F9/466 , G06F9/528 , G06F11/36 , G06F11/3648 , G06F11/3656 , G06F12/0806 , G06F12/0811 , G06F12/0862 , G06F12/0875 , G06F12/1009 , G06F13/4221 , G06F15/7867 , G06F15/80 , G06F15/8007 , G06F2212/452 , G06F2212/602 , G06F2212/604 , G06F2212/62 , Y02D10/13 , Y02D10/14 , Y02D10/151

Abstract: Apparatus and methods are disclosed for nullifying memory store instructions identified in a target field of a nullification instruction. In some examples of the disclosed technology, an apparatus can include memory and one or more block-based processor cores configured to fetch and execute a plurality of instruction blocks. One of the cores can include a control unit configured, based at least in part on receiving a nullification instruction, to obtain an instruction identification for a memory access instruction of a plurality of memory access instructions, based on a target field of the nullification instruction. The memory access instruction associated with the instruction identification is nullified. The memory access instruction is in a first instruction block of the plurality of instruction blocks. Based on the nullified memory access instruction, a subsequent memory access instruction from the first instruction block is executed.

Abstract translation: 公开了用于使在无效指令的目标字段中标识的存储器存储指令无效的装置和方法。在所公开技术的一些示例中，装置可以包括被配置为获取和执行多个指令块的存储器和一个或多个基于块的处理器核心。核心之一可以包括基于至少部分地基于接收到无效指令而配置的控制单元，以基于无效指令的目标字段获得多个存储器访问指令的存储器访问指令的指令标识。与指令识别相关联的存储器访问指令无效。存储器访问指令位于多个指令块的第一指令块中。基于无效存储器访问指令，执行来自第一指令块的后续存储器访问指令。

12.

发明申请
OUT OF ORDER COMMIT 审中-公开
Title translation: 超出订单

公开(公告)号：WO2017048606A1

公开(公告)日：2017-03-23

申请号：PCT/US2016/051208

申请日：2016-09-12

Applicant: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventor： BURGER, Douglas C. , SMITH, Aaron L.

IPC: G06F9/46 , G06F9/52

CPC classification number: G06F9/3016 , G06F9/268 , G06F9/30007 , G06F9/30021 , G06F9/30036 , G06F9/3004 , G06F9/30043 , G06F9/30047 , G06F9/3005 , G06F9/30058 , G06F9/30072 , G06F9/30076 , G06F9/30087 , G06F9/3009 , G06F9/30098 , G06F9/30101 , G06F9/30105 , G06F9/3013 , G06F9/30145 , G06F9/30167 , G06F9/30189 , G06F9/32 , G06F9/321 , G06F9/345 , G06F9/35 , G06F9/355 , G06F9/3557 , G06F9/3802 , G06F9/3804 , G06F9/3822 , G06F9/3824 , G06F9/383 , G06F9/3836 , G06F9/3838 , G06F9/3842 , G06F9/3848 , G06F9/3851 , G06F9/3853 , G06F9/3855 , G06F9/3859 , G06F9/3867 , G06F9/3891 , G06F9/466 , G06F9/528 , G06F11/36 , G06F11/3648 , G06F11/3656 , G06F12/0806 , G06F12/0811 , G06F12/0862 , G06F12/0875 , G06F12/1009 , G06F13/4221 , G06F15/7867 , G06F15/80 , G06F15/8007 , G06F2212/452 , G06F2212/602 , G06F2212/604 , G06F2212/62 , Y02D10/13 , Y02D10/14 , Y02D10/151

Abstract: The disclosed technology can be used for executing and committing instruction blocks of a block-based processor architecture out-of-order. In one example of the disclosed technology, an apparatus can include a plurality of block-based processor cores which can include a first group of cores and a second group of cores. The first group of cores can be configured to commit instruction blocks of the set of instruction blocks in a sequential program order. The second group of cores can be configured to commit instruction blocks of the set of instruction blocks out-of-order relative to the sequential program order.

Abstract translation: 所公开的技术可以用于执行和提交无序的基于块的处理器架构的指令块。在所公开的技术的一个示例中，设备可以包括多个基于块的处理器核，其可以包括第一组核和第二组核。第一组核心可以被配置为以顺序的程序顺序提交该组指令块的指令块。可以将第二组核心配置为相对于顺序程序顺序提交指令块集无序的指令块。

13.

发明申请
MIXED-WIDTH SIMD OPERATIONS HAVING EVEN-ELEMENT AND ODD-ELEMENT OPERATIONS USING REGISTER PAIR FOR WIDE DATA ELEMENTS 审中-公开
Title translation: 使用寄存器对进行数据元素的混合宽度SIMD操作具有即时元素和空白元素操作

公开(公告)号：WO2017014892A1

公开(公告)日：2017-01-26

申请号：PCT/US2016/038487

申请日：2016-06-21

Applicant: QUALCOMM INCORPORATED

Inventor： MAHURIN, Eric Wayne , INGLE, Ajay Anant

IPC: G06F9/30

CPC classification number: G06F9/30036 , G06F9/3001 , G06F9/30032 , G06F9/30098 , G06F9/30109 , G06F9/30112

Abstract: Systems and methods relate to a mixed-width single instruction multiple data (SIMD) instruction which has at least a source vector operand comprising data elements of a first bit-width and a destination vector operand comprising data elements of a second bit-width, wherein the second bit-width is either half of or twice the first bit-width. Correspondingly, one of the source or destination vector operands is expressed as a pair of registers, a first register and a second register. The other vector operand is expressed as a single register. Data elements of the first register correspond to even-numbered data elements of the other vector operand expressed as a single register, and data elements of the second register correspond to data elements of the other vector operand expressed as a single register.

Abstract translation: 系统和方法涉及混合宽度单指令多数据（SIMD）指令，其具有至少包括第一位宽的数据元素和包含第二位宽的数据元素的目的地向量操作数的源向量操作数，其中第二个位宽是第一个位宽的一半或两倍。相应地，源或目标向量操作数之一被表示为一对寄存器，第一寄存器和第二寄存器。另一个向量操作数表示为单个寄存器。第一寄存器的数据元素对应于表示为单个寄存器的另一向量操作数的偶数数据元，第二寄存器的数据元对应于表示为单个寄存器的另一向量操作数的数据元。

14.

发明申请
PARALLELIZATION OF SCALAR OPERATIONS BY VECTOR PROCESSORS USING DATA-INDEXED ACCUMULATORS IN VECTOR REGISTER FILES, AND RELATED CIRCUITS, METHODS, AND COMPUTER-READABLE MEDIA 审中-公开
Title translation: 使用矢量寄存器文件中的数据索引累加器的矢量处理器和相关电路，方法和计算机可读介质的标量运算的并行化

公开(公告)号：WO2016014213A1

公开(公告)日：2016-01-28

申请号：PCT/US2015/038013

申请日：2015-06-26

Applicant: QUALCOMM INCORPORATED

Inventor： CODRESCU, Lucian , MAHURIN, Eric, Wayne

IPC: G06F9/30

CPC classification number: G06F15/82 , G06F9/3001 , G06F9/30098 , G06F9/30109 , G06F9/3012

Abstract: Parallelization of scalar operations by vector processors using data-indexed accumulators in vector register files, related circuits, methods, and computer-readable media are disclosed. In one aspect, a vector processor comprises a vector register file providing a plurality of write ports and a plurality of vector registers each providing a plurality of accumulators. The vector processor receives an input data vector. For each of the plurality of write ports, the vector processor executes vector operation(s) for accessing an input data value of the input data vector, and determining, based on the input data value, a register index for a vector register among the plurality of vector registers, and an accumulator index for an accumulator among the plurality of accumulators of the vector register. Based on the register index, a register value is retrieved from the register index, and a scalar operation is performed based on the register value and the accumulator index.

Abstract translation: 公开了使用向量寄存器文件，相关电路，方法和计算机可读介质中的数据索引累加器的矢量处理器的标量运算的并行化。一方面，向量处理器包括提供多个写入端口的向量寄存器文件和多个向量寄存器，每个向量寄存器提供多个累加器。向量处理器接收输入数据向量。对于多个写入端口中的每一个，向量处理器执行用于访问输入数据向量的输入数据值的向量操作，并且基于输入数据值，确定多个写入端口中的向量寄存器的寄存器索引矢量寄存器的多个累加器中的累加器的累加器索引。基于寄存器索引，从寄存器索引检索寄存器值，并且基于寄存器值和累加器索引执行标量运算。

15.

发明申请
FREEING PHYSICAL REGISTERS IN A MICROPROCESSOR 审中-公开
Title translation: 在微处理器中释放物理寄存器

公开(公告)号：WO2015142435A1

公开(公告)日：2015-09-24

申请号：PCT/US2015/014541

申请日：2015-02-05

Applicant: QUALCOMM INCORPORATED

Inventor： KRISHNA, Anil , WU, Weidan , NAVADA, Sandeep Suresh , CHOUDHARY, Niket Kumar , SMITH, Rodney Wayne

IPC: G06F9/38

CPC classification number: G06F9/30098 , G06F9/3832 , G06F9/3838 , G06F9/384 , G06F9/3861

Abstract: Physical register scrubbing in computer microprocessors. Most instructions in a computer program produce some output value that is destined for one or more architected registers. These architected destination registers are renamed, in the processor pipeline, to physical registers in order to improve performance by exposing more instruction level parallelism to the processor. In one aspect, a method comprises identifying, in a reorder buffer, a first instruction and a second instruction, without intervening potential pipeline flushers, that write to the same architected destination register, in order to free the physical register corresponding to the older of the two instructions.

Abstract translation: 计算机微处理器中的物理寄存器擦除。计算机程序中的大多数指令产生一些输出值，用于一个或多个架构化寄存器。这些架构化的目标寄存器在处理器流水线中被重命名为物理寄存器，以便通过向处理器暴露更多的指令级并行性来提高性能。在一个方面，一种方法包括在重排序缓冲器中识别第一指令和第二指令，而不间断地写入到同一架构目的寄存器的潜在流水线冲洗器，以便释放对应于较早的两个说明。

16.

发明申请
A DATA PROCESSING APPARATUS AND METHOD FOR PERFORMING SEGMENTED OPERATIONS 审中-公开
Title translation: 一种数据处理装置和执行分离操作的方法

公开(公告)号：WO2015118299A1

公开(公告)日：2015-08-13

申请号：PCT/GB2015/050132

申请日：2015-01-21

Applicant: ARM LIMITED

Inventor： EYOLE-MONONO, Mbou , REID, Alastair David , BÖTTCHER, Matthias Lothar , GABRIELLI, Giacomo

IPC: G06F9/30

CPC classification number: G06F9/30036 , G06F9/30014 , G06F9/30072 , G06F9/30076 , G06F9/30098 , G06F9/3887 , G06F9/3891

Abstract: A data processing apparatus and method are provided for performing segmented operations. The data processing apparatus comprises a vector register store for storing vector operands, and vector processing circuitry providing N lanes of parallel processing, and arranged to perform a segmented operation on up to N data elements provided by a specified vector operand, each data element being allocated to one of the N lanes. The up to N data elements forms a plurality of segments, and performance of the segmented operation comprises performing a separate operation on the data elements of each segment, the separate operation involving interaction between the lanes containing the data elements of the associated segment. Predicate generation circuitry is responsive to a compute descriptor instruction specifying an input vector operand comprising a plurality of segment descriptors, to generate per lane predicate information used by the vector processing circuitry when performing the segmented operation to maintain a boundary between each of the plurality of segments. As a result, interaction between lanes containing data elements from different segments is prevented. This allows very effective utilisation of the lanes of parallel processing within the vector processing circuitry to be achieved.

Abstract translation: 提供了一种用于执行分段操作的数据处理装置和方法。数据处理装置包括用于存储向量操作数的向量寄存器存储器和提供N个并行处理通道的向量处理电路，并且被布置为对由指定向量操作数提供的多达N个数据元素执行分段操作，每个数据元素被分配到N条车道之一。最多N个数据元素形成多个段，并且分段操作的执行包括对每个段的数据元素执行单独的操作，该单独操作涉及包含相关段的数据元素的通道之间的交互。谓词生成电路响应于指定包括多个段描述符的输入向量操作数的计算描述符指令，以在执行分割操作时生成由向量处理电路使用的每通道谓词信息，以维持多个段中的每个段之间的边界。结果，阻止了包含来自不同段的数据元素的通道之间的相互作用。这允许在矢量处理电路内非常有效地利用并行处理的通道。

17.

发明申请
DEBUGGING NON-DETERMINISTIC EMBEDDED SYSTEMS 审中-公开
Title translation: 调查非决定性嵌入式系统

公开(公告)号：WO2015061022A8

公开(公告)日：2015-07-16

申请号：PCT/US2014058999

申请日：2014-10-03

Applicant: PURDUE RESEARCH FOUNDATION

Inventor： BAGCHI SAURABH , CRETI MATTHEW EDWARD TAN , SUNDARAM VINAITHEERTHAN , EUGSTER PATRICK

IPC: G06F11/36

CPC classification number: G06F11/366 , G06F9/30098 , G06F9/3865 , G06F11/362 , G06F11/364 , H03M7/60

Abstract: An embedded device includes a processor executing instructions from module(s) in a code memory. The instructions specify: reading data from two non-deterministic registers (NDRs) of different types, compressing the data using respective, different compression algorithms, and storing the compressed data in a nonvolatile medium. A method of enabling debug tracing in a computer program product (CPP) includes locating instructions in the CPP that read NDRs, determining types of the NDRs, and adding instruction(s) to the CPP to compress the values read using compression algorithms corresponding to the respective NDR types. An emulator in a computer-readable medium receives emulation-target instructions (ETIs) and compressed NDR data, and emulates an execution sequence of the ETIs by determining NDR-reading instructions, determining a type of the NDR read by each, decompressing a portion of the NDR data using a type-specific decompressor, and updating emulated-machine state based on the decompressed portion.

Abstract translation: 嵌入式设备包括执行来自代码存储器中的模块的指令的处理器。该指令指定：从两种不同类型的非确定性寄存器（NDR）读取数据，使用相应的不同压缩算法压缩数据，并将压缩数据存储在非易失性介质中。一种启用计算机程序产品（CPP）中的调试跟踪的方法包括定位读取NDR的CPP中的指令，确定NDR的类型，以及向CPP添加指令以使用对应于各自的NDR类型。计算机可读介质中的仿真器接收仿真目标指令（ETI）和压缩的NDR数据，并且通过确定NDR读取指令来模拟ETI的执行顺序，确定由每个读取指令读取的NDR的类型，解压缩一部分使用类型特定解压缩器的NDR数据，以及基于解压缩部分更新仿真机状态。

18.

发明申请
METHODS, APPARATUS, INSTRUCTIONS AND LOGIC TO PROVIDE VECTOR SUB-BYTE DECOMPRESSION FUNCTIONALITY 审中-公开
Title translation: 方法，装置，说明和逻辑提供矢量子字节分解功能

公开(公告)号：WO2015017870A1

公开(公告)日：2015-02-05

申请号：PCT/US2014/055540

申请日：2014-09-13

Applicant: INTEL CORPORATION , ULIEL, Tal , OULD-AHMED-VALL, Elmoustapha , WILLHALM, Thomas , VALENTINE, Robert

Inventor： ULIEL, Tal , OULD-AHMED-VALL, Elmoustapha , WILLHALM, Thomas , VALENTINE, Robert

IPC: G06F15/80

CPC classification number: G06F9/30036 , G06F9/30032 , G06F9/30098 , G06F9/30109 , G06F9/30112 , G06F9/3016 , G06F9/3889

Abstract: Methods, apparatus, instructions and logic provide SIMD vector sub-byte decompression functionality. Embodiments include shuffling a first and second byte into the least significant portion of a first vector element, and a third and fourth byte into the most significant portion. Processing continues shuffling a fifth and sixth byte into the least significant portion of a second vector element, and a seventh and eighth byte into the most significant portion. Then by shifting the first vector element by a first shift count and the second vector element by a second shift count, sub-byte elements are aligned to the least significant bits of their respective bytes. Processors then shuffle a byte from each of the shifted vector elements' least significant portions into byte positions of a destination vector element, and from each of the shifted vector elements' most significant portions into byte positions of another destination vector element.

Abstract translation: 方法，装置，指令和逻辑提供SIMD矢量子字节解压缩功能。实施例包括将第一和第二字节混洗到第一向量元素的最低有效部分中，以及将第三和第四字节混入最重要部分。处理继续将第五和第六字节洗牌到第二向量元素的最低有效部分，并将第七和第八字节重新排列到最高有效部分。然后，通过将第一移位计数和第二向量元素移位第二移位计数，将子字节元素与它们各自的字节的最低有效位对齐。然后，处理器将来自移位向量元素的最小有效部分的每一个的字节从目的地向量元素的字节位置以及从每个移位向量元素的最高有效部分转移到另一目的地向量元素的字节位置。

19.

发明申请
SELECTIVELY CONTROLLING INSTRUCTION EXECUTION IN TRANSACTIONAL PROCESSING 审中-公开
Title translation: 选择性地控制交易处理中的指令执行

公开(公告)号：WO2013186722A3

公开(公告)日：2014-03-27

申请号：PCT/IB2013054813

申请日：2013-06-12

Applicant: IBM , IBM UK

Inventor： GREINER DAN , JACOBI CHRISTIAN , SLEGEL TIMOTHY , ROGERS ROBERT

IPC: G06F9/46

CPC classification number: G06F9/3861 , G06F9/3001 , G06F9/3004 , G06F9/30047 , G06F9/30076 , G06F9/30087 , G06F9/30098 , G06F9/30145 , G06F9/30181 , G06F9/3834 , G06F9/466

Abstract: Execution of instructions in a transactional environment is selectively controlled. A TRANSACTION BEGIN instruction initiates a transaction and includes controls that selectively indicate whether certain types of instructions are permitted to execute within the transaction. The controls include one or more of an allow access register modification control and an allow floating point operation control.

Abstract translation: 选择性地控制事务环境中的指令的执行。一个TRANSACTION BEGIN指令启动事务，并且包括有选择地指示某些类型的指令是否允许在事务内执行的控件。控制包括允许访问寄存器修改控制和允许浮点操作控制中的一个或多个。

20.

发明申请
SYSTEMS, APPARATUSES, AND METHODS FOR PERFORMING A SHUFFLE AND OPERATION (SHUFFLE-OP) 审中-公开
Title translation: 用于执行机器人和操作的系统，设备和方法（SHUFFLE-OP）

公开(公告)号：WO2014004050A2

公开(公告)日：2014-01-03

申请号：PCT/US2013/044799

申请日：2013-06-07

Applicant: INTEL CORPORATION

Inventor： ERMOLAEV, Igor , ELMOUSTAPHA, Ould-Ahmed-Vall , TOLL, Bret , CORBAL, Jesus , NARAIKIN, Andrey

CPC classification number: G06F9/30036 , G06F9/3001 , G06F9/30029 , G06F9/30032 , G06F9/30098 , G06F9/30145 , G06F9/3016

Abstract: Embodiments of systems, apparatuses, and methods for performing in a computer processor a data element shuffle and an operation on the shuffled data elements in response to a single data element shuffle and an operation instruction that includes a destination vector register operand, a first and second source vector register operands, an immediate value, and an opcode are described.

Abstract translation: 用于在计算机处理器中响应于单个数据元素随机播放的数据元素随机播放和对混洗数据元素的操作的系统，装置和方法的实施例，以及包括目的地向量寄存器操作数，第一和第二描述源向量寄存器操作数，立即值和操作码。

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification