CROSS OBJECTS DE-DUPLICATION
    11.
    发明申请

    公开(公告)号:US20170286441A1

    公开(公告)日:2017-10-05

    申请号:US15085588

    申请日:2016-03-30

    CPC classification number: G06F16/1748 G06F16/2365

    Abstract: Some embodiments of the present invention include a method for determining duplicate records in multiple objects and may include combining records associated with a first object with records associated with a second object to generate a third object, wherein the first object is related to the second object; performing de-duplication on the third object to generate a combined group of duplicate sets; and from the combined group of duplicate sets, identifying at least one duplicate set associated with both the first object and the second object based on the duplicate set having at least one record associated with the first object and at least one record associated with the second object.

    BULK DEDUPLICATION DETECTION
    12.
    发明申请

    公开(公告)号:US20170242868A1

    公开(公告)日:2017-08-24

    申请号:US15052382

    申请日:2016-02-24

    CPC classification number: G06F17/30303 G06F7/32 G06F17/30489 G06F17/30598

    Abstract: Some embodiments of the present invention include a system and method for removing duplicate records from a group of records in a database system. The method includes generating a first cluster of records from the group of records, generating a second cluster of records from the group of records, identifying sets of duplicate records in the first cluster of records, and identifying sets of duplicate records in the second cluster of records. The method also includes merging at least two sets of duplicate records associated with both the first cluster and the second cluster of records to form a merged set of duplicate records. The merging is performed based on the at least two sets of duplicate records having a common record. Duplicate records in the group of records may then be removed by removing duplicate records from the merged set of duplicate records.

Patent Agency Ranking