POST-HOC MANAGEMENT OF DATASETS
    1.
    发明申请

    公开(公告)号:US20170293671A1

    公开(公告)日:2017-10-12

    申请号:US15480971

    申请日:2017-04-06

    Applicant: Google Inc.

    CPC classification number: G06F21/6218 G06F16/211 G06F16/215

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating a catalog for multiple datasets, the method comprising accessing multiple extant data sets, the extant data sets including data sets that are independently generated and structurally dissimilar; organizing the data sets into collections, each data set in each collection belonging to the collection based on collection data associated with the data set; for each collection of data sets: determining, from a subset of the data sets that belong to the collection, metadata that describe the data sets that belong to the collection, wherein the metadata does not include the collection data, and attributing, to other data sets in the collection, the metadata determined from the subset of data sets; and generating, from the collections of data sets and the determined metadata, a catalog for the multiple datasets.

    BATCHING INPUTS TO A MACHINE LEARNING MODEL
    2.
    发明申请

    公开(公告)号:US20170286864A1

    公开(公告)日:2017-10-05

    申请号:US15091381

    申请日:2016-04-05

    Applicant: Google Inc.

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for batching inputs to machine learning models. One of the methods includes receiving a stream of requests, each request identifying a respective input for processing by a first machine learning model; adding the respective input from each request to a first queue of inputs for processing by the first machine learning model; determining, at a first time, that a count of inputs in the first queue as of the first time equals or exceeds a maximum batch size and, in response: generating a first batched input from the inputs in the queue as of the first time so that a count of inputs in the first batched input equals the maximum batch size, and providing the first batched input for processing by the first machine learning model.

Patent Agency Ranking