Automatic generation of composite datasets based on hierarchical fields
    1.
    发明授权
    Automatic generation of composite datasets based on hierarchical fields 有权
    基于分层字段自动生成复合数据集

    公开(公告)号:US09542446B1

    公开(公告)日:2017-01-10

    申请号:US14996179

    申请日:2016-01-14

    IPC分类号: G06F17/30

    摘要: Datasets are annotated with metadata including categories. Each category corresponds to one or more fields. A hierarchy mapping is generated to indicate a hierarchical relationship between different categories. A natural language query specifies a first granularity level indicating a particular category and one or more field values corresponding to the particular category. Based on the hierarchy mapping, one or more categories that are hierarchically related to the particular category are identified. Based on the metadata, two or more datasets that include at least one hierarchically related category is selected. Based on the first granularity level, one or more dataset filters are generated. The one or more dataset filters are translated to a second granularity level corresponding to the at least one hierarchically related category. The translated filters are applied to at least one of the selected datasets. The two or more datasets are joined to generate a composite dataset.

    摘要翻译: 数据集用包含类别的元数据进行注释。 每个类别对应一个或多个字段。 生成层次映射以指示不同类别之间的层次关系。 自然语言查询指定指示特定类别的第一粒度级别和对应于特定类别的一个或多个字段值。 基于层次映射,识别与特定类别分层相关的一个或多个类别。 基于元数据,选择包括至少一个层级相关类别的两个或多个数据集。 基于第一粒度级别,生成一个或多个数据集过滤器。 一个或多个数据集过滤器被转换为对应于至少一个层级相关类别的第二粒度级别。 已翻译的过滤器应用于所选数据集中的至少一个。 连接两个或更多数据集以生成复合数据集。

    Automatic generation of composite datasets based on hierarchical fields

    公开(公告)号:US10678860B1

    公开(公告)日:2020-06-09

    申请号:US15282780

    申请日:2016-09-30

    摘要: Datasets are annotated with metadata including categories. Each category corresponds to one or more fields. A hierarchy mapping is generated to indicate a hierarchical relationship between different categories. A natural language query specifies a first granularity level indicating a particular category and one or more field values corresponding to the particular category. Based on the hierarchy mapping, one or more categories that are hierarchically related to the particular category are identified. Based on the metadata, two or more datasets that include at least one hierarchically related category is selected. Based on the first granularity level, one or more dataset filters are generated. The one or more dataset filters are translated to a second granularity level corresponding to the at least one hierarchically related category. The translated filters are applied to at least one of the selected datasets. The two or more datasets are joined to generate a composite dataset.