Linking Data Elements Based on Similarity Data Values and Semantic Annotations
    2.
    发明申请
    Linking Data Elements Based on Similarity Data Values and Semantic Annotations 审中-公开
    基于相似性数据值和语义​​注释链接数据元素

    公开(公告)号:US20130332466A1

    公开(公告)日:2013-12-12

    申请号:US13491724

    申请日:2012-06-08

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30864

    摘要: Data elements from data sources and having a data value set are linked by using hash functions to determine a dimensionally reduced instance signature for each data element based on all data values associated with that data element to yield a plurality of dimensionally reduced instance signatures of equivalent fixed size such that similarities among the data values in the data value sets across all data elements is maintained among the plurality of instance signatures. Candidate pairs of data elements to link are identified using the plurality of instance signatures in locality sensitive hash functions, and a similarity index is generated for each candidate pair using a pre-determined measure of similarity. Candidate pairs of data elements having a similarity index above a given threshold are linked.

    摘要翻译: 来自数据源并且具有数据值集合的数据元素通过使用散列函数来链接,以基于与该数据元素相关联的所有数据值来确定每个数据元素的尺寸上减小的实例签名,以产生多个等距固定的尺寸缩小的实例签名 大小,使得在多个实例签名之间保持跨所有数据元素的数据值中的数据值之间的相似性。 使用位置敏感哈希函数中的多个实例签名来识别要链接的候选数据元素对,并且使用预定的相似度测量为每个候选对生成相似性索引。 具有高于给定阈值的相似性指数的候选对的数据元素被链接。

    Automatically Reviewing Information Mappings Across Different Information Models
    4.
    发明申请
    Automatically Reviewing Information Mappings Across Different Information Models 有权
    自动查看不同信息模型的信息映射

    公开(公告)号:US20120036110A1

    公开(公告)日:2012-02-09

    申请号:US12851963

    申请日:2010-08-06

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30289

    摘要: A computer-implemented method, system, and program product for automatically reviewing a mapping between information models. The method includes: receiving a mapping between an element in the first information model to an element in the second information model. Each element is associated with an element identifier and an element value, and the mapping signifies a relationship between the element in the first information model and the element in the second information model. The method further includes comparing the received mapping against one or more known indications of suspicious mappings to determine if the received mapping resembles one of the indications of suspicious mappings. If the received mapping is determined to be suspicious, identifying the received mapping as one that requires review.

    摘要翻译: 一种计算机实现的方法,系统和程序产品,用于自动查看信息模型之间的映射。 该方法包括:接收第一信息模型中的元素与第二信息模型中的元素之间的映射。 每个元素与元素标识符和元素值相关联,并且映射表示第一信息模型中的元素与第二信息模型中的元素之间的关系。 该方法还包括将接收的映射与可疑映射的一个或多个已知指示进行比较,以确定所接收的映射是否类似于可疑映射的指示之一。 如果接收到的映射被确定为可疑,则将所接收的映射标识为需要审查的映射。

    Automatically reviewing information mappings across different information models
    5.
    发明授权
    Automatically reviewing information mappings across different information models 有权
    自动查看不同信息模型的信息映射

    公开(公告)号:US09330115B2

    公开(公告)日:2016-05-03

    申请号:US12851963

    申请日:2010-08-06

    IPC分类号: G06F17/30 G06F7/00

    CPC分类号: G06F17/30289

    摘要: A computer-implemented method, system, and program product for automatically reviewing a mapping between information models. The method includes: receiving a mapping between an element in the first information model to an element in the second information model. Each element is associated with an element identifier and an element value, and the mapping signifies a relationship between the element in the first information model and the element in the second information model. The method further includes comparing the received mapping against one or more known indications of suspicious mappings to determine if the received mapping resembles one of the indications of suspicious mappings. If the received mapping is determined to be suspicious, identifying the received mapping as one that requires review.

    摘要翻译: 一种计算机实现的方法,系统和程序产品,用于自动查看信息模型之间的映射。 该方法包括:接收第一信息模型中的元素与第二信息模型中的元素之间的映射。 每个元素与元素标识符和元素值相关联,并且映射表示第一信息模型中的元素与第二信息模型中的元素之间的关系。 该方法还包括将接收的映射与可疑映射的一个或多个已知指示进行比较,以确定所接收的映射是否类似于可疑映射的指示之一。 如果接收到的映射被确定为可疑,则将所接收的映射标识为需要审查的映射。