-
公开(公告)号:EP2567313A1
公开(公告)日:2013-03-13
申请号:EP11778249.0
申请日:2011-05-04
申请人: Google Inc.
发明人: CHAMBERS, Craig D. , RANIWALA, Ashish , PERRY, Frances J. , HENRY, Robert R. , TIGANI, Jordan , ADAMS, Stephen R. , BRADSHAW, Robert , WEIZENBAUM, Nathan
CPC分类号: G06F9/45504 , G06F8/314 , G06F8/34 , G06F8/433 , G06F9/30 , G06F9/38 , G06F9/3851 , G06F9/3885 , G06F9/44 , G06F9/445 , G06F9/4494 , G06F9/45533 , G06F9/4843 , G06F16/24532 , G06F16/24547 , G06F21/577 , G06F21/62 , G06F21/6218 , G06F2221/034
摘要: A data parallel pipeline may specify multiple parallel data objects that contain multiple elements and multiple parallel operations that operate on the parallel data objects. Based on the data parallel pipeline, a dataflow graph of deferred parallel data objects and deferred parallel operations corresponding to the data parallel pipeline may be generated and one or more graph transformations may be applied to the dataflow graph to generate a revised dataflow graph that includes one or more of the deferred parallel data objects and deferred, combined parallel data operations. The deferred, combined parallel operations may be executed to produce materialized parallel data objects corresponding to the deferred parallel data objects.
摘要翻译: 数据并行流水线可以指定多个并行数据对象,其中包含对并行数据对象进行操作的多个元素和多个并行操作。 基于数据并行流水线,可以生成与数据并行流水线相对应的延迟并行数据对象和延迟并行操作的数据流图,并且可以将一个或多个图变换应用于数据流图,以生成修改后的数据流图,其中包括一个 或更多的延迟并行数据对象和延迟的并行数据操作。 可以执行延迟的组合并行操作以产生与延迟并行数据对象相对应的物化并行数据对象。
-
2.
公开(公告)号:EP3126974A1
公开(公告)日:2017-02-08
申请号:EP14752529.9
申请日:2014-08-01
申请人: Google Inc.
CPC分类号: G06F9/466 , G06F8/453 , G06F9/3867 , G06F9/3885 , G06F9/455 , G06F9/5066 , G06F11/3006 , G06F17/30463 , G06F17/30569
摘要: One example method includes identifying synchronous code including instructions specifying a computing operation to be performed on a set of data; transforming the synchronous code into a pipeline application including one or more pipeline objects; identifying a first input data set on which to execute the pipeline application; executing the pipeline application on a first input data set to produce a first output data set; after executing the pipeline application on the first input data set, identifying a second input data set on which to execute the pipeline application; determining a set of differences between the first input data set and second input data set; and executing the pipeline application on the set of differences to produce a second output data set.
摘要翻译: 一个示例性方法包括识别包括指定要针对一组数据执行的计算操作的指令的同步代码; 将同步代码转换成包括一个或多个流水线对象的流水线应用程序; 识别在其上执行流水线应用的第一输入数据集; 在第一输入数据集上执行流水线应用以产生第一输出数据集; 在第一输入数据集上执行流水线应用之后,识别执行流水线应用的第二输入数据集; 确定所述第一输入数据集和所述第二输入数据集之间的差异集合; 以及对所述差异集执行所述流水线应用以产生第二输出数据集。
-