-
公开(公告)号:US20070079179A1
公开(公告)日:2007-04-05
申请号:US11240982
申请日:2005-09-30
申请人: Stephan Jourdan , Avinash Sodani , Michael Fetterman , Per Hammarlund , Ronak Singhal , Glenn Hinton
发明人: Stephan Jourdan , Avinash Sodani , Michael Fetterman , Per Hammarlund , Ronak Singhal , Glenn Hinton
IPC分类号: G06F11/00
CPC分类号: G06F9/3001 , G06F9/3012 , G06F9/3885
摘要: In one embodiment, the present invention includes a method for executing an operation on low order portions of first and second source operands using a first execution stack of a processor and executing the operation on high order portions of the first and second source operands using a second execution stack of the processor, where the operation in the second execution stack is staggered by one or more cycles from the operation in the first execution stack. Other embodiments are described and claimed.
摘要翻译: 在一个实施例中,本发明包括一种使用处理器的第一执行堆栈来执行第一和第二源操作数的低阶部分的操作的方法,并且使用第二和第二源操作数对第一和第二源操作数的高阶部分执行操作 处理器的执行堆栈,其中第二执行堆栈中的操作与第一执行堆栈中的操作交错一个或多个周期。 描述和要求保护其他实施例。
-
公开(公告)号:US07457938B2
公开(公告)日:2008-11-25
申请号:US11240982
申请日:2005-09-30
申请人: Stephan Jourdan , Avinash Sodani , Michael Fetterman , Per Hammarlund , Ronak Singhal , Glenn Hinton
发明人: Stephan Jourdan , Avinash Sodani , Michael Fetterman , Per Hammarlund , Ronak Singhal , Glenn Hinton
CPC分类号: G06F9/3001 , G06F9/3012 , G06F9/3885
摘要: In one embodiment, the present invention includes a method for executing an operation on low order portions of first and second source operands using a first execution stack of a processor and executing the operation on high order portions of the first and second source operands using a second execution stack of the processor, where the operation in the second execution stack is staggered by one or more cycles from the operation in the first execution stack. Other embodiments are described and claimed.
摘要翻译: 在一个实施例中,本发明包括一种用于使用处理器的第一执行堆栈来执行第一和第二源操作数的低阶部分的操作的方法,并且使用第二和第二源操作数对第一和第二源操作数的高阶部分执行操作 处理器的执行堆栈,其中第二执行堆栈中的操作与第一执行堆栈中的操作交错一个或多个周期。 描述和要求保护其他实施例。
-
公开(公告)号:US20070283129A1
公开(公告)日:2007-12-06
申请号:US11321022
申请日:2005-12-28
IPC分类号: G06F15/00
CPC分类号: G06F9/3017 , G06F9/30036 , G06F9/30192 , G06F9/3836
摘要: According to one embodiment, a method is disclosed. The method includes receiving a value at a vector length (VL) tracker and establishing a VL for subsequent micro-operations (μops) that are to be executed corresponding to the value.
摘要翻译: 根据一个实施例,公开了一种方法。 该方法包括:在向量长度(VL)跟踪器处接收一个值,并为对应于该值执行的后续微操作(muops)建立一个VL。
-
公开(公告)号:US07404065B2
公开(公告)日:2008-07-22
申请号:US11315964
申请日:2005-12-21
申请人: Stephan Jourdan , Per Hammarlund , Michael Fetterman , Michael P. Cornaby , Glenn Hinton , Avinash Sodani
发明人: Stephan Jourdan , Per Hammarlund , Michael Fetterman , Michael P. Cornaby , Glenn Hinton , Avinash Sodani
CPC分类号: G06F9/345 , G06F9/3017 , G06F9/325 , G06F9/3455 , G06F9/3844
摘要: In one embodiment, a method for flow optimization and prediction for vector streaming single instruction, multiple data (SIMD) extension (VSSE) memory operations is disclosed. The method comprises generating an optimized micro-operation (μop) flow for an instruction to operate on a vector if the instruction is predicted to be unmasked and unit-stride, the instruction to access elements in memory, and accessing via the optimized μop flow two or more of the elements at the same time without determining masks of the two or more elements. Other embodiments are also described.
摘要翻译: 在一个实施例中,公开了用于向量流单个指令,多数据(SIMD)扩展(VSSE)存储器操作的流优化和预测的方法。 该方法包括:如果预测指令是未屏蔽和单步的,则生成用于对矢量进行操作的指令的优化的微操作(muop)流程,访问存储器中的元件的指令以及经由优化的muop流2访问 或更多的元素,而不确定两个或更多个元件的掩模。 还描述了其它实施例。
-
公开(公告)号:US07457932B2
公开(公告)日:2008-11-25
申请号:US11323000
申请日:2005-12-30
申请人: Per Hammarlund , Stephan Jourdan , Michael Fetterman , Glenn Hinton , Sebastien Hily , Ronak Singhal
发明人: Per Hammarlund , Stephan Jourdan , Michael Fetterman , Glenn Hinton , Sebastien Hily , Ronak Singhal
IPC分类号: G06F12/00
CPC分类号: G06F9/30043 , G06F9/30032
摘要: A method is disclosed. The method includes scheduling a load operation at least twice the size of a maximum access supported by a memory device, dividing the load operation into a plurality of separate load operation segments having a size equivalent to the maximum access supported by the memory device, and performing each of the plurality of load operation segments. A further method is disclosed where a temporary register is used to minimize the number of memory accesses to support unaligned accesses.
摘要翻译: 公开了一种方法。 该方法包括将加载操作调度至少是由存储器件支持的最大访问大小的两倍,将加载操作划分成具有等于存储器设备支持的最大访问大小的多个单独的加载操作段,以及执行 多个加载操作段中的每一个。 公开了一种另外的方法,其中使用临时寄存器来最小化用于支持未对齐访问的存储器访问的数量。
-
公开(公告)号:US20070156990A1
公开(公告)日:2007-07-05
申请号:US11323000
申请日:2005-12-30
申请人: Per Hammarlund , Stephan Jourdan , Michael Fetterman , Glenn Hinton , Sebastien Hily , Ronak Singhal
发明人: Per Hammarlund , Stephan Jourdan , Michael Fetterman , Glenn Hinton , Sebastien Hily , Ronak Singhal
IPC分类号: G06F13/00
CPC分类号: G06F9/30043 , G06F9/30032
摘要: A method is disclosed. The method includes scheduling a load operation at least twice the size of a maximum access supported by a memory device, dividing the load operation into a plurality of separate load operation segments having a size equivalent to the maximum access supported by the memory device, and performing each of the plurality of load operation segments. A further method is disclosed where a temporary register is used to minimize the number of memory accesses to support unaligned accesses.
摘要翻译: 公开了一种方法。 该方法包括将加载操作调度至少是由存储器件支持的最大访问大小的两倍,将加载操作划分成具有等于存储器设备支持的最大访问大小的多个单独的加载操作段,以及执行 多个加载操作段中的每一个。 公开了一种另外的方法,其中使用临时寄存器来最小化用于支持未对齐访问的存储器访问的数量。
-
公开(公告)号:US08510536B2
公开(公告)日:2013-08-13
申请号:US13535685
申请日:2012-06-28
申请人: Stephan Jourdan , Michael Fetterman , Michael Cornaby , Per Hammarlund , Ronak Signhal , Glenn Hinton
发明人: Stephan Jourdan , Michael Fetterman , Michael Cornaby , Per Hammarlund , Ronak Signhal , Glenn Hinton
CPC分类号: G06F9/3824 , G06F9/30036 , G06F15/8084
摘要: Techniques for vector completion mask (VCM) handling are provided. A data structure includes a mask field for each operand of a particular operation. A processor attempts to execute the operation with multiple operands, which are identified in the data structure by the mask fields. If operands are successfully retrieved for execution with the operation, then the corresponding mask field within the data structure is cleared. The processor can reset if any field remains set within the data structure and can re-process the operation with operands that were not previously handled with the operation.
摘要翻译: 提供矢量完成掩码(VCM)处理技术。 数据结构包括用于特定操作的每个操作数的掩码字段。 处理器尝试通过掩码字段在数据结构中标识的多个操作数来执行操作。 如果成功检索操作数以执行操作,则数据结构中的相应掩码字段将被清除。 如果任何字段在数据结构中保持设置,并且可以使用以前未被操作的操作数重新处理操作,则处理器可以重置。
-
公开(公告)号:US20120272046A1
公开(公告)日:2012-10-25
申请号:US13535685
申请日:2012-06-28
申请人: Stephan Jourdan , Michael Fetterman , Michael Cornaby , Per Hammarlund , Ronak Signhal , Glenn Hinton
发明人: Stephan Jourdan , Michael Fetterman , Michael Cornaby , Per Hammarlund , Ronak Signhal , Glenn Hinton
IPC分类号: G06F9/302
CPC分类号: G06F9/3824 , G06F9/30036 , G06F15/8084
摘要: Techniques for vector completion mask (VCM) handling are provided. A data structure includes a mask field for each operand of a particular operation. A processor attempts to execute the operation with multiple operands, which are identified in the data structure by the mask fields. If operands are successfully retrieved for execution with the operation, then the corresponding mask field within the data structure is cleared. The processor can reset if any field remains set within the data structure and can re-process the operation with operands that were not previously handled with the operation.
摘要翻译: 提供矢量完成掩码(VCM)处理技术。 数据结构包括用于特定操作的每个操作数的掩码字段。 处理器尝试通过掩码字段在数据结构中标识的多个操作数来执行操作。 如果成功检索操作数以执行操作,则数据结构中的相应掩码字段将被清除。 如果任何字段在数据结构中保持设置,并且可以使用以前未被操作的操作数重新处理操作,则处理器可以重置。
-
公开(公告)号:US08239659B2
公开(公告)日:2012-08-07
申请号:US11529850
申请日:2006-09-29
申请人: Stephan Jourdan , Michael Fetterman , Michael Cornaby , Per Hammarlund , Ronak Signhal , Glenn Hinton
发明人: Stephan Jourdan , Michael Fetterman , Michael Cornaby , Per Hammarlund , Ronak Signhal , Glenn Hinton
CPC分类号: G06F9/3824 , G06F9/30036 , G06F15/8084
摘要: Techniques for vector completion mask (VCM) handling are provided. A data structure includes a mask field for each operand of a particular operation. A processor attempts to execute the operation with multiple operands, which are identified in the data structure by the mask fields. If operands are successfully retrieved for execution with the operation, then the corresponding mask field within the data structure is cleared. The processor can reset if any field remains set within the data structure and can re-process the operation with operands that were not previously handled with the operation.
摘要翻译: 提供矢量完成掩码(VCM)处理技术。 数据结构包括用于特定操作的每个操作数的掩码字段。 处理器尝试通过掩码字段在数据结构中标识的多个操作数来执行操作。 如果成功检索操作数以执行操作,则数据结构中的相应掩码字段将被清除。 如果任何字段在数据结构中保持设置,并且可以使用以前未被操作的操作数重新处理操作,则处理器可以重置。
-
公开(公告)号:US20080082785A1
公开(公告)日:2008-04-03
申请号:US11529850
申请日:2006-09-29
申请人: Stephan Jourdan , Michael Fetterman , Michael Cornaby , Per Hammarlund , Ronak Signhal , Glenn Hinton
发明人: Stephan Jourdan , Michael Fetterman , Michael Cornaby , Per Hammarlund , Ronak Signhal , Glenn Hinton
IPC分类号: G06F15/76
CPC分类号: G06F9/3824 , G06F9/30036 , G06F15/8084
摘要: Techniques for vector completion mask (VCM) handling are provided. A data structure includes a mask field for each operand of a particular operation. A processor attempts to execute the operation with multiple operands, which are identified in the data structure by the mask fields. If operands are successfully retrieved for execution with the operation, then the corresponding mask field within the data structure is cleared. The processor can reset if any field remains set within the data structure and can re-process the operation with operands that were not previously handled with the operation.
摘要翻译: 提供矢量完成掩码(VCM)处理技术。 数据结构包括用于特定操作的每个操作数的掩码字段。 处理器尝试通过掩码字段在数据结构中标识的多个操作数来执行操作。 如果成功检索操作数以执行操作,则数据结构中的相应掩码字段将被清除。 如果任何字段在数据结构中保持设置,并且可以使用以前未被操作的操作数重新处理操作,则处理器可以重置。
-
-
-
-
-
-
-
-
-