POST-HOC MANAGEMENT OF DATASETS
    1.
    发明申请

    公开(公告)号:US20170293671A1

    公开(公告)日:2017-10-12

    申请号:US15480971

    申请日:2017-04-06

    Applicant: Google Inc.

    CPC classification number: G06F21/6218 G06F16/211 G06F16/215

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating a catalog for multiple datasets, the method comprising accessing multiple extant data sets, the extant data sets including data sets that are independently generated and structurally dissimilar; organizing the data sets into collections, each data set in each collection belonging to the collection based on collection data associated with the data set; for each collection of data sets: determining, from a subset of the data sets that belong to the collection, metadata that describe the data sets that belong to the collection, wherein the metadata does not include the collection data, and attributing, to other data sets in the collection, the metadata determined from the subset of data sets; and generating, from the collections of data sets and the determined metadata, a catalog for the multiple datasets.

Patent Agency Ranking