Arithmetic and control unit, arithmetic and control method and parallel processor

    公开(公告)号:EP2549379B1

    公开(公告)日:2018-11-21

    申请号:EP12177054.9

    申请日:2012-07-19

    发明人: Kyo, Shorin

    IPC分类号: G06F8/41 G06F9/30 G06F17/11

    摘要: An attribute group storage unit acquires and holds attribute groups set to respective data blocks. A scenario determination unit determines respective transfer systems of the respective blocks between a memory of the lowest hierarchy and a memory of another hierarchy based on those attribute groups and a configuration of an arithmetic unit which is the parallel processor, and controls the transfer of the respective data blocks according to the determined transfer systems, and the parallel arithmetic operation corresponding to the transfer. Each of the attribute groups is necessary to determine the transfer systems, and includes one or more attributes not depending on the configuration of the parallel processor. The attribute groups of the write blocks are set assuming that each of the write blocks has already been located in the memory of another hierarchy, and is transferred to the memory of the lowest hierarchy.

    INCREMENTAL PARALLEL PROCESSING OF DATA
    3.
    发明公开
    INCREMENTAL PARALLEL PROCESSING OF DATA 审中-公开
    INKREMENTELLE PARALLELE VERARBEITUNG VON DATEN

    公开(公告)号:EP3126974A1

    公开(公告)日:2017-02-08

    申请号:EP14752529.9

    申请日:2014-08-01

    申请人: Google Inc.

    IPC分类号: G06F9/50 G06F17/30 G06F9/45

    摘要: One example method includes identifying synchronous code including instructions specifying a computing operation to be performed on a set of data; transforming the synchronous code into a pipeline application including one or more pipeline objects; identifying a first input data set on which to execute the pipeline application; executing the pipeline application on a first input data set to produce a first output data set; after executing the pipeline application on the first input data set, identifying a second input data set on which to execute the pipeline application; determining a set of differences between the first input data set and second input data set; and executing the pipeline application on the set of differences to produce a second output data set.

    摘要翻译: 一个示例性方法包括识别包括指定要针对一组数据执行的计算操作的指令的同步代码; 将同步代码转换成包括一个或多个流水线对象的流水线应用程序; 识别在其上执行流水线应用的第一输入数据集; 在第一输入数据集上执行流水线应用以产生第一输出数据集; 在第一输入数据集上执行流水线应用之后,识别执行流水线应用的第二输入数据集; 确定所述第一输入数据集和所述第二输入数据集之间的差异集合; 以及对所述差异集执行所述流水线应用以产生第二输出数据集。

    PROCÉDÉ D'OPTIMISATION DE TRAITEMENT PARALLÈLE DE DONNÉES SUR UNE PLATEFORME MATÉRIELLE.
    5.
    发明公开
    PROCÉDÉ D'OPTIMISATION DE TRAITEMENT PARALLÈLE DE DONNÉES SUR UNE PLATEFORME MATÉRIELLE. 有权
    法来优化数据并行处理在硬件平台上

    公开(公告)号:EP2805234A1

    公开(公告)日:2014-11-26

    申请号:EP13700337.2

    申请日:2013-01-17

    申请人: Thales

    IPC分类号: G06F9/45

    摘要: The invention relates to a method for optimising the parallel processing of data on a hardware platform comprising at least one calculation unit comprising a plurality of processing units capable of executing a plurality of executable tasks in parallel, wherein all the data to be processed is broken down into subsets of data, a same sequence of operations being carried out on each subset of data. The method of the invention comprises obtaining (50, 52) the maximum number of subsets of data to be processed by a same sequence of operations, and a maximum number of tasks that can be executed in parallel by a calculation unit of the hardware platform, determining (54) at least two processing partitions, each of said processing partitions corresponding to the partition of all the data into a number of data groups, and to the assignment of at least one executable task, capable of executing said sequence of operations, to each subset of data from said data group, and selecting (60, 62) the processing partition that makes it possible to obtain an optimal measurement value depending on a predetermined criterion. Programming code instructions implementing said selected processing partition are then obtained. One use of the method of the invention is the selection of an optimal hardware platform according to a measurement of execution performance.

    摘要翻译: 本发明涉及一种用于优化在硬件平台上,包括至少一个计算单元,其包括能够执行并行可执行任务的复数,worin的所有数据的处理单元的多个数据的并行处理被处理被分解 入数据的子集,操作的相同序列被执行的数据的每个子集。 发明包括获得所述的方法(50,52),以由操作的相同序列要处理的数据的子集的最大数目,和任务的最大数量也可以并行通过硬件平台的计算单元来执行, 确定性开采(54)至少两个处理分区,每个分区的对应于所有数据的分割成多个数据组的所述处理的分区,并且向至少一个可执行任务的分配,能够执行操作的所述序列的,至 从所述dataGroup时,和选择(60,62)的处理的分区确实数据的每个子集,能够得到的值最佳的测量根据预定的标准。 的编程代码指令实现所述选择的处理的分区然后获得。 一个使用本发明的方法的一个最佳硬件平台gemäß的选择以执行性能的测量。

    TILE COMMUNICATION OPERATOR
    6.
    发明公开
    TILE COMMUNICATION OPERATOR 审中-公开
    BOX通信运营商

    公开(公告)号:EP2628075A2

    公开(公告)日:2013-08-21

    申请号:EP11832992.9

    申请日:2011-09-23

    发明人: RINGSETH, Paul F.

    IPC分类号: G06F9/30 G06F9/38 G06F9/06

    CPC分类号: G06F8/453 G06F8/314

    摘要: A high level programming language provides a tile communication operator that decomposes a computational space into sub-spaces (i.e., tiles) that may be mapped to execution structures (e.g., thread groups) of data parallel compute nodes. An indexable type with a rank and element type defines the computational space. For an input indexable type, the tile communication operator produces an output indexable type with the same rank as the input indexable type and an element type that is a tile of the input indexable type. The output indexable type provides a local view structure of the computational space that enables coalescing of global memory accesses in a data parallel compute node.

    INFORMATION PROCESSING APPARATUS, COMMUNICATION METHOD, COMMUNICATION PROGRAM AND INFORMATION PROCESSING SYSTEM
    9.
    发明公开
    INFORMATION PROCESSING APPARATUS, COMMUNICATION METHOD, COMMUNICATION PROGRAM AND INFORMATION PROCESSING SYSTEM 审中-公开
    信息资源管理系统,KOMMUNIKATIONSVERFAHREN,KOMMUNIKATIONSPROGRAMM UND INFORMATIONSVERARBEITUNGSSYSTEM

    公开(公告)号:EP3021210A1

    公开(公告)日:2016-05-18

    申请号:EP15190676.5

    申请日:2015-10-20

    申请人: FUJITSU LIMITED

    IPC分类号: G06F3/06 G06F9/45

    摘要: An information processing apparatus, among a plurality of information processing apparatuses, to which one of pieces of local data is assigned, the pieces of local data having been obtained by dividing global data shared by the plurality of information processing apparatuses, includes: a storage unit that includes a first storage area sectioned into prescribed units, and stores local data; a processor that executes a process including: detecting a plurality of continuous sections to which the target local data is to be written in a second storage area that is sectioned into the prescribed units in the different information processing apparatus, on the basis of storage area information that identifies data to which the target local data corresponds in the global data; and extracting as many pieces of local data as specified by the number of the continuous sections and transmitting the data to the different information processing apparatus.

    摘要翻译: 一种信息处理装置,在多个信息处理装置中,分配了一个本地数据,通过划分由多个信息处理装置共享的全局数据而获得的本地数据,包括:存储单元 其包括被划分为规定单位的第一存储区域并且存储本地数据; 执行处理的处理器,包括:基于存储区域信息,检测在不同信息处理装置中被划分为规定单元的第二存储区域中将要写入目标本地数据的多个连续区段 其识别目标本地数据在全局数据中对应的数据; 并且提取由连续段的数量指定的多条本地数据,并将数据发送到不同的信息处理设备。