-
公开(公告)号:US20170293671A1
公开(公告)日:2017-10-12
申请号:US15480971
申请日:2017-04-06
Applicant: Google Inc.
Inventor: Philip Korn , Steven Euijong Whang , Natalya Fridman Noy , Sudip Roy , Neoklis Polyzotis , Alon Yitzchak Halevy , Christopher Olston
CPC classification number: G06F21/6218 , G06F16/211 , G06F16/215
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating a catalog for multiple datasets, the method comprising accessing multiple extant data sets, the extant data sets including data sets that are independently generated and structurally dissimilar; organizing the data sets into collections, each data set in each collection belonging to the collection based on collection data associated with the data set; for each collection of data sets: determining, from a subset of the data sets that belong to the collection, metadata that describe the data sets that belong to the collection, wherein the metadata does not include the collection data, and attributing, to other data sets in the collection, the metadata determined from the subset of data sets; and generating, from the collections of data sets and the determined metadata, a catalog for the multiple datasets.
-
公开(公告)号:US20170286864A1
公开(公告)日:2017-10-05
申请号:US15091381
申请日:2016-04-05
Applicant: Google Inc.
Inventor: Noah Fiedel , Christopher Olston , Jeremiah Harmsen
IPC: G06N99/00
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for batching inputs to machine learning models. One of the methods includes receiving a stream of requests, each request identifying a respective input for processing by a first machine learning model; adding the respective input from each request to a first queue of inputs for processing by the first machine learning model; determining, at a first time, that a count of inputs in the first queue as of the first time equals or exceeds a maximum batch size and, in response: generating a first batched input from the inputs in the queue as of the first time so that a count of inputs in the first batched input equals the maximum batch size, and providing the first batched input for processing by the first machine learning model.
-