Cross objects de-duplication
    11.
    发明授权

    公开(公告)号:US10949395B2

    公开(公告)日:2021-03-16

    申请号:US15085588

    申请日:2016-03-30

    Abstract: Some embodiments of the present invention include a method for determining duplicate records in multiple objects and may include combining records associated with a first object with records associated with a second object to generate a third object, wherein the first object is related to the second object; performing de-duplication on the third object to generate a combined group of duplicate sets; and from the combined group of duplicate sets, identifying at least one duplicate set associated with both the first object and the second object based on the duplicate set having at least one record associated with the first object and at least one record associated with the second object.

    USING ROW VALUE CONSTRUCTOR (RVC) BASED QUERIES TO GROUP RECORDS OF A DATABASE FOR MULTI-THREAD EXECUTION

    公开(公告)号:US20200250189A1

    公开(公告)日:2020-08-06

    申请号:US16855671

    申请日:2020-04-22

    Abstract: Methods for splitting a plurality of records into subsets for parallel processing by separate threads are disclosed. In one embodiment, a plurality of records are added to a database. The adding includes tracking, for each unique value found in the plurality of records within a selected one of the sequence of fields, a statistic based on the values found in the plurality of records within a next one of the sequence of fields. Then a plurality of queries are generated, each includes a row value constructor (RVC) with a logical operation, a specific one of the unique values found within the selected one of the sequence of fields, and, for that specific value, the tracked statistic. Then a plurality of threads are caused to execute that each submit one of the queries and operate on a subset of the plurality of records responsive to execution of that query.

    Methods and systems for performing data assessment

    公开(公告)号:US10733613B2

    公开(公告)日:2020-08-04

    申请号:US15397824

    申请日:2017-01-04

    Abstract: Some embodiments of the present invention include a method for performing data assessment. The method includes receiving, by a first server computing system, a request to assess first data stored in a storage device associated with a second server computing system, the request including one or more rules indicating how the first data is to be assessed; performing, by the first server computing system, an assessment of the first data based on the one or more rules and based on a data source associated with the first server computing system, the data source including data provided by one or more data providers, the assessment of the first data including assessing outdated information and missing information based on the data source; and providing, by the first server computing system, an assessment result from the assessment of the first data and an option to engage in an enrichment of the first data based on the assessment result and the data source.

    Merging along object hierarchies
    15.
    发明授权

    公开(公告)号:US10521406B2

    公开(公告)日:2019-12-31

    申请号:US15282486

    申请日:2016-09-30

    Abstract: Some embodiments of the present invention include a method for merging nodes in hierarchies and include receiving, by a database system, a request to merge a first node in a first hierarchy of a plurality of nodes into a second node, the first node in the first hierarchy having child nodes; determining, by the database system, whether the merge request violates each of a plurality of constraints, the plurality of constraints comprising circular dependency of nodes, deleted node and ordering of nodes; and re-parenting, by the database system, in response to a determination that the merge request does not violate each of the plurality of constraints, the child nodes of the first node in the first hierarchy of nodes to the second node in response to the merging of the first node into the second node.

    Bulk deduplication detection
    16.
    发明授权

    公开(公告)号:US10152497B2

    公开(公告)日:2018-12-11

    申请号:US15052382

    申请日:2016-02-24

    Abstract: Some embodiments of the present invention include a system and method for removing duplicate records from a group of records in a database system. The method includes generating a first cluster of records from the group of records, generating a second cluster of records from the group of records, identifying sets of duplicate records in the first cluster of records, and identifying sets of duplicate records in the second cluster of records. The method also includes merging at least two sets of duplicate records associated with both the first cluster and the second cluster of records to form a merged set of duplicate records. The merging is performed based on the at least two sets of duplicate records having a common record. Duplicate records in the group of records may then be removed by removing duplicate records from the merged set of duplicate records.

Patent Agency Ranking