-
公开(公告)号:US20160285711A1
公开(公告)日:2016-09-29
申请号:US14931006
申请日:2015-11-03
Applicant: Google Inc.
Inventor: Tyler Akidau , Robert Bradshaw , Ben Chambers , Craig Chambers , Reuven Lax , Daniel Mills , Frances Perry
IPC: H04L12/26
Abstract: A method includes receiving data corresponding one of streaming data or batch data and a content of the received data for computation. The method also includes determining an event time of the data for slicing the data, determining a processing time to output results of the received data, and emitting at least a portion of the results of the received data based on the processing time and the event time.
Abstract translation: 一种方法包括接收对应于流数据或批数据之一的数据以及用于计算的接收数据的内容。 该方法还包括确定所述数据的事件的时间切片中的数据,确定处理时间所接收到的数据的输出结果,并发射至少基于处理时间所接收到的数据的结果的一部分,并且所述事件的时间 。
-
公开(公告)号:US20160117188A1
公开(公告)日:2016-04-28
申请号:US14989867
申请日:2016-01-07
Applicant: Google Inc.
Inventor: Robert Bradshaw , Craig D. Chambers , Ezra Cooper , Ashish Raniwala , Frances J. Perry
CPC classification number: G06F9/466 , G06F8/453 , G06F9/3867 , G06F9/3885 , G06F9/455 , G06F9/5066 , G06F11/3006 , G06F17/30463 , G06F17/30569
Abstract: One example method includes identifying synchronous code including instructions specifying a computing operation to be performed on a set of data; transforming the synchronous code into a pipeline application including one or more pipeline objects; identifying a first input data set on which to execute the pipeline application; executing the pipeline application on a first input data set to produce a first output data set; after executing the pipeline application on the first input data set, identifying a second input data set on which to execute the pipeline application; determining a set of differences between the first input data set and second input data set; and executing the pipeline application on the set of differences to produce a second output data set.
Abstract translation: 一个示例性方法包括识别包括指定要对一组数据执行的计算操作的指令的同步代码; 将同步代码转换成包括一个或多个流水线对象的流水线应用程序; 识别在其上执行管线应用的第一输入数据集; 在第一输入数据集上执行流水线应用以产生第一输出数据集; 在第一输入数据集上执行流水线应用之后,识别执行流水线应用的第二输入数据集; 确定所述第一输入数据集和所述第二输入数据集之间的差异集合; 并且对所述差异集执行流水线应用以产生第二输出数据集。
-
公开(公告)号:US20150248304A1
公开(公告)日:2015-09-03
申请号:US14622556
申请日:2015-02-13
Applicant: Google Inc.
Inventor: Craig D. Chambers , Ashish Raniwala , Frances J. Perry , Stephen R. Adams , Robert R. Henry , Robert Bradshaw , Nathan Weizenbaum
CPC classification number: G06F9/45504 , G06F8/314 , G06F8/34 , G06F8/433 , G06F9/30 , G06F9/38 , G06F9/3851 , G06F9/3885 , G06F9/44 , G06F9/445 , G06F9/4494 , G06F9/45533 , G06F9/4843 , G06F17/30445 , G06F17/30471 , G06F21/577 , G06F21/62 , G06F21/6218 , G06F2221/034
Abstract: A data parallel pipeline may specify multiple parallel data objects that contain multiple elements and multiple parallel operations that operate on the parallel data objects. Based on the data parallel pipeline, a dataflow graph of deferred parallel data objects and deferred parallel operations corresponding to the data parallel pipeline may be generated and one or more graph transformations may be applied to the dataflow graph to generate a revised dataflow graph that includes one or more of the deferred parallel data objects and deferred, combined parallel data operations. The deferred, combined parallel operations may be executed to produce materialized parallel data objects corresponding to the deferred parallel data objects.
Abstract translation: 数据并行流水线可以指定多个并行数据对象,其中包含对并行数据对象进行操作的多个元素和多个并行操作。 基于数据并行流水线,可以生成与数据并行流水线相对应的延迟并行数据对象和延迟并行操作的数据流图,并且可以将一个或多个图变换应用于数据流图,以生成修改后的数据流图,其中包括一个 或更多的延迟并行数据对象和延迟的并行数据操作。 可以执行延迟的组合并行操作以产生与延迟并行数据对象相对应的物化并行数据对象。
-
公开(公告)号:US20170255491A1
公开(公告)日:2017-09-07
申请号:US15395225
申请日:2016-12-30
Applicant: Google Inc.
Inventor: Robert Bradshaw , Rafael de Jesús Fernández Moctezuma , Daniel Mills , Samuel Green McVeety , Samuel Carl Whittle , Andrei Maksimenka , Cosmin Ionel Arad , Mark Brian Shields , Harris Samuel Nover , Manuel Alfred Fahndrich , Jeffrey Paul Gardner , Mikhail Shmulyan , Reuven Lax , Ahmet Altay , Craig D. Chambers
IPC: G06F9/50
Abstract: A job that receives as input a data stream is executed. For the job, it is iteratively determined a backlog growth over a first period of time; a backlog volume; and whether to adjust a quantity of processing resources. For each iteration for which the quantity of processing resources allocated to the job are determined to be adjusted, adjusting the quantity of processing resources allocated to the job. For each iteration for which the quantity of processing resources allocated to the job are determined not to be adjusted, maintaining the quantity of processing resources allocated to the job.
-
公开(公告)号:US20170242715A1
公开(公告)日:2017-08-24
申请号:US15483044
申请日:2017-04-10
Applicant: Google Inc.
Inventor: Craig D. Chambers , Ashish Raniwala , Frances J. Perry , Stephen R. Adams , Robert R. Henry , Robert Bradshaw , Nathan Weizenbaum
CPC classification number: G06F9/45504 , G06F8/314 , G06F8/34 , G06F8/433 , G06F9/30 , G06F9/38 , G06F9/3851 , G06F9/3885 , G06F9/44 , G06F9/445 , G06F9/4494 , G06F9/45533 , G06F9/4843 , G06F17/30445 , G06F17/30471 , G06F21/577 , G06F21/62 , G06F21/6218 , G06F2221/034
Abstract: A data parallel pipeline may specify multiple parallel data objects that contain multiple elements and multiple parallel operations that operate on the parallel data objects. Based on the data parallel pipeline, a dataflow graph of deferred parallel data objects and deferred parallel operations corresponding to the data parallel pipeline may be generated and one or more graph transformations may be applied to the dataflow graph to generate a revised dataflow graph that includes one or more of the deferred parallel data objects and deferred, combined parallel data operations. The deferred, combined parallel operations may be executed to produce materialized parallel data objects corresponding to the deferred parallel data objects.
-
公开(公告)号:US20150277965A1
公开(公告)日:2015-10-01
申请号:US14231983
申请日:2014-04-01
Applicant: GOOGLE INC.
Inventor: Robert Bradshaw , Craig D. Chambers , Ezra Cooper , Ashish Raniwala , Frances J. Perry
IPC: G06F9/46
CPC classification number: G06F9/466 , G06F8/453 , G06F9/3867 , G06F9/3885 , G06F9/455 , G06F9/5066 , G06F11/3006 , G06F17/30463 , G06F17/30569
Abstract: One example method includes identifying synchronous code including instructions specifying a computing operation to be performed on a set of data; transforming the synchronous code into a pipeline application including one or more pipeline objects; identifying a first input data set on which to execute the pipeline application; executing the pipeline application on a first input data set to produce a first output data set; after executing the pipeline application on the first input data set, identifying a second input data set on which to execute the pipeline application; determining a set of differences between the first input data set and second input data set; and executing the pipeline application on the set of differences to produce a second output data set.
Abstract translation: 一个示例性方法包括识别包括指定要对一组数据执行的计算操作的指令的同步代码; 将同步代码转换成包括一个或多个流水线对象的流水线应用程序; 识别在其上执行管线应用的第一输入数据集; 在第一输入数据集上执行流水线应用以产生第一输出数据集; 在第一输入数据集上执行流水线应用之后,识别执行流水线应用的第二输入数据集; 确定所述第一输入数据集和所述第二输入数据集之间的差异集合; 并且对所述差异集执行流水线应用以产生第二输出数据集。
-
公开(公告)号:US10037187B2
公开(公告)日:2018-07-31
申请号:US14931006
申请日:2015-11-03
Applicant: Google Inc.
Inventor: Tyler Akidau , Robert Bradshaw , Ben Chambers , Craig Chambers , Reuven Lax , Daniel Mills , Frances Perry
IPC: G06F7/00 , H04L12/70 , H04L12/841
Abstract: A method includes receiving data corresponding one of streaming data or batch data and a content of the received data for computation. The method also includes determining an event time of the data for slicing the data, determining a processing time to output results of the received data, and emitting at least a portion of the results of the received data based on the processing time and the event time.
-
公开(公告)号:US09626202B2
公开(公告)日:2017-04-18
申请号:US14622556
申请日:2015-02-13
Applicant: Google Inc.
Inventor: Craig D. Chambers , Ashish Raniwala , Frances J. Perry , Stephen R. Adams , Robert R. Henry , Robert Bradshaw , Nathan Weizenbaum
IPC: G06F9/44 , G06F9/46 , G06F9/45 , G06F9/455 , G06F21/62 , G06F9/38 , G06F9/48 , G06F9/445 , G06F17/30 , G06F9/30
CPC classification number: G06F9/45504 , G06F8/314 , G06F8/34 , G06F8/433 , G06F9/30 , G06F9/38 , G06F9/3851 , G06F9/3885 , G06F9/44 , G06F9/445 , G06F9/4494 , G06F9/45533 , G06F9/4843 , G06F17/30445 , G06F17/30471 , G06F21/577 , G06F21/62 , G06F21/6218 , G06F2221/034
Abstract: A data parallel pipeline may specify multiple parallel data objects that contain multiple elements and multiple parallel operations that operate on the parallel data objects. Based on the data parallel pipeline, a dataflow graph of deferred parallel data objects and deferred parallel operations corresponding to the data parallel pipeline may be generated and one or more graph transformations may be applied to the dataflow graph to generate a revised dataflow graph that includes one or more of the deferred parallel data objects and deferred, combined parallel data operations. The deferred, combined parallel operations may be executed to produce materialized parallel data objects corresponding to the deferred parallel data objects.
-
公开(公告)号:US09268597B2
公开(公告)日:2016-02-23
申请号:US14231983
申请日:2014-04-01
Applicant: GOOGLE INC.
Inventor: Robert Bradshaw , Craig D. Chambers , Ezra Cooper , Ashish Raniwala , Frances J. Perry
IPC: G06T1/00 , G01N33/48 , G06Q50/00 , G06F7/00 , G06F17/00 , G06F17/30 , G06F15/173 , G06F9/26 , G06F15/00 , G01R31/28 , G06F9/44 , G06F9/46
CPC classification number: G06F9/466 , G06F8/453 , G06F9/3867 , G06F9/3885 , G06F9/455 , G06F9/5066 , G06F11/3006 , G06F17/30463 , G06F17/30569
Abstract: One example method includes identifying synchronous code including instructions specifying a computing operation to be performed on a set of data; transforming the synchronous code into a pipeline application including one or more pipeline objects; identifying a first input data set on which to execute the pipeline application; executing the pipeline application on a first input data set to produce a first output data set; after executing the pipeline application on the first input data set, identifying a second input data set on which to execute the pipeline application; determining a set of differences between the first input data set and second input data set; and executing the pipeline application on the set of differences to produce a second output data set.
Abstract translation: 一个示例性方法包括识别包括指定要对一组数据执行的计算操作的指令的同步代码; 将同步代码转换成包括一个或多个流水线对象的流水线应用程序; 识别在其上执行管线应用的第一输入数据集; 在第一输入数据集上执行流水线应用以产生第一输出数据集; 在第一输入数据集上执行流水线应用之后,识别执行流水线应用的第二输入数据集; 确定所述第一输入数据集和所述第二输入数据集之间的差异集合; 并且对所述差异集执行流水线应用以产生第二输出数据集。
-
公开(公告)号:US08959499B2
公开(公告)日:2015-02-17
申请号:US14033145
申请日:2013-09-20
Applicant: Google Inc.
Inventor: Craig D. Chambers , Ashish Raniwala , Frances J. Perry , Stephen R. Adams , Robert R. Henry , Robert Bradshaw , Nathan Weizenbaum
CPC classification number: G06F9/45504 , G06F8/314 , G06F8/34 , G06F8/433 , G06F9/30 , G06F9/38 , G06F9/3851 , G06F9/3885 , G06F9/44 , G06F9/445 , G06F9/4494 , G06F9/45533 , G06F9/4843 , G06F17/30445 , G06F17/30471 , G06F21/577 , G06F21/62 , G06F21/6218 , G06F2221/034
Abstract: A data parallel pipeline may specify multiple parallel data objects that contain multiple elements and multiple parallel operations that operate on the parallel data objects. Based on the data parallel pipeline, a dataflow graph of deferred parallel data objects and deferred parallel operations corresponding to the data parallel pipeline may be generated and one or more graph transformations may be applied to the dataflow graph to generate a revised dataflow graph that includes one or more of the deferred parallel data objects and deferred, combined parallel data operations. The deferred, combined parallel operations may be executed to produce materialized parallel data objects corresponding to the deferred parallel data objects.
Abstract translation: 数据并行流水线可以指定多个并行数据对象,其中包含对并行数据对象进行操作的多个元素和多个并行操作。 基于数据并行流水线,可以生成与数据并行流水线相对应的延迟并行数据对象和延迟并行操作的数据流图,并且可以将一个或多个图变换应用于数据流图,以生成修改后的数据流图,其中包括一个 或更多的延迟并行数据对象和延迟的并行数据操作。 可以执行延迟的组合并行操作以产生与延迟并行数据对象相对应的物化并行数据对象。
-
-
-
-
-
-
-
-
-