Database schema for efficient data assessment

    公开(公告)号:US10387388B2

    公开(公告)日:2019-08-20

    申请号:US15438434

    申请日:2017-02-21

    Abstract: A computing system may create a database schema representing tables for data assessment, organization identification (ID), rule ID, data source ID, and data assessment ID. The data assessment table may include a first field representing an organization ID, a second field representing a rule ID, a third field representing a data source ID, wherein the fields correspond to a records in the organization ID table, rule ID table and data source ID table, respectively. The data assessment table may also include a fourth field representing a match status indicating whether a record in data assessment ID table matches a record of a data source associated with the third field, a fifth field representing a data assessment record ID and corresponding to a record in data assessment record ID table, and a sixth field configured as a column family and representing a field of the business object.

    Matching objects using match rules and lookup key

    公开(公告)号:US10235476B2

    公开(公告)日:2019-03-19

    申请号:US14723866

    申请日:2015-05-28

    Abstract: Some embodiments of the present invention include a method for identifying match candidates in a database object and may include generating, by a database system, a match key associated with a lookup field of a database object. The method may further include activating, by the database system, a matching rule associated with the match key, and receiving, by the database system, a request to search for duplicate candidates in the database object. The request may include an input entity. The duplicate candidates in the database object may be identified by using the input entity and the matching rule.

    Bulk duplication detection supporting data encryption

    公开(公告)号:US11354285B2

    公开(公告)日:2022-06-07

    申请号:US16848878

    申请日:2020-04-15

    Abstract: Provided herein are system, methods and computer program products for identifying duplicate records stored in a database system, comprising: generating a plurality of encrypted match indexes for each of a plurality of records stored in the database system, each of the plurality of encrypted match indexes encrypts a value of each encryption enabled field of a respective one of the plurality of records defined by at least one match rule, creating a cluster of records comprising at least one set containing at least two records of the plurality of records, the at least two records having respective encrypted match indexes corresponding to the at least one match rule, causing identification of duplicate records in the at least one set according to detection of records of the at least one set having respective match indexes matching the at least one match rule, and outputting an indication of the identified duplicate records.

    Matching objects using keys based on match rules

    公开(公告)号:US09740743B2

    公开(公告)日:2017-08-22

    申请号:US14518145

    申请日:2014-10-20

    CPC classification number: G06F17/30489

    Abstract: Matching objects using keys based on match rules is described. A system generates a match rule key based on a match rule, wherein the match rule specifies whether two objects match. The system creates candidate keys by applying the match rule key to data objects. The system creates a probe key by applying the match rule key to a probe object. The system determines whether the probe key matches a candidate key. The system determines whether the probe object matches a candidate object based on applying the match rule to the probe object and the candidate object if the probe key matches the candidate key corresponding to the candidate object. The system identifies the probe object and the candidate object as matching based on the match rule if the probe object matches the candidate object.

    Optimized subset processing for de-duplication

    公开(公告)号:US10901996B2

    公开(公告)日:2021-01-26

    申请号:US15052556

    申请日:2016-02-24

    Abstract: Some embodiments of the present invention include a method for identifying duplicate records from a group of records in a database system. The method includes generating a cluster of records from a group of records based on one or more keys; splitting the cluster of records into multiple subsets of records with each subset of records having fewer number of records than the cluster of records, wherein the splitting the cluster of records into multiple subsets of records is based on a number of records in the cluster of records exceeding a threshold; causing duplicate sets of records in each of the subsets of records to be identified, wherein a duplicate set of records includes one or more records, and wherein when a duplicate set of records includes two or more records, the two or more records are duplicates of one another; merging all of the duplicate sets of records identified from the multiple subsets of records forming a first group of duplicate sets of records; and forming a representative set of records based on selecting a representative record from each of the duplicate sets in the first group of duplicate sets of records.

    Using row value constructor (RVC) based queries to group records of a database for multi-thread execution

    公开(公告)号:US10671604B2

    公开(公告)日:2020-06-02

    申请号:US15884322

    申请日:2018-01-30

    Abstract: Methods for splitting a plurality of records into subsets for parallel processing by separate threads are disclosed. In one embodiment, a plurality of records are added to a database. The adding includes tracking, for each unique value found in the plurality of records within a selected one of the sequence of fields, a statistic based on the values found in the plurality of records within a next one of the sequence of fields. Then a plurality of queries are generated, each includes a row value constructor (RVC) with a logical operation, a specific one of the unique values found within the selected one of the sequence of fields, and, for that specific value, the tracked statistic. Then a plurality of threads are caused to execute that each submit one of the queries and operate on a subset of the plurality of records responsive to execution of that query.

    Using row value constructor (RVC) based queries to group records of a database for multi-thread execution

    公开(公告)号:US11256690B2

    公开(公告)日:2022-02-22

    申请号:US16855671

    申请日:2020-04-22

    Abstract: Methods for splitting a plurality of records into subsets for parallel processing by separate threads are disclosed. In one embodiment, a plurality of records are added to a database. The adding includes tracking, for each unique value found in the plurality of records within a selected one of the sequence of fields, a statistic based on the values found in the plurality of records within a next one of the sequence of fields. Then a plurality of queries are generated, each includes a row value constructor (RVC) with a logical operation, a specific one of the unique values found within the selected one of the sequence of fields, and, for that specific value, the tracked statistic. Then a plurality of threads are caused to execute that each submit one of the queries and operate on a subset of the plurality of records responsive to execution of that query.

    Methods and systems for protecting computing resources

    公开(公告)号:US10237336B2

    公开(公告)日:2019-03-19

    申请号:US15388845

    申请日:2016-12-22

    Abstract: Some embodiments of the present invention include a method for controlling requests to be transmitted to a server computing system. The method generating, by a first requester, a request to be transmitted to a first server computing system, the first requester being part of a first group of requesters, each of the requesters in the first group of requesters is configured to access an indicator to determine whether the first server computing system is ready to receive its request, the indicator being set or reset at least based on a response received from the first server computing system, the indicator stored in a memory device associated with a second server computing system; and based on the indicator indicating that the first server computing system is not ready, delaying from transmitting, by the first requester, the request to the first server computing system.

    MATCHING OBJECTS USING KEYS BASED ON MATCH RULES
    10.
    发明申请
    MATCHING OBJECTS USING KEYS BASED ON MATCH RULES 有权
    使用基于匹配规则的匹配匹配对象

    公开(公告)号:US20160110354A1

    公开(公告)日:2016-04-21

    申请号:US14518145

    申请日:2014-10-20

    CPC classification number: G06F17/30489

    Abstract: Matching objects using keys based on match rules is described. A system generates a match rule key based on a match rule, wherein the match rule specifies whether two objects match. The system creates candidate keys by applying the match rule key to data objects. The system creates a probe key by applying the match rule key to a probe object. The system determines whether the probe key matches a candidate key. The system determines whether the probe object matches a candidate object based on applying the match rule to the probe object and the candidate object if the probe key matches the candidate key corresponding to the candidate object. The system identifies the probe object and the candidate object as matching based on the match rule if the probe object matches the candidate object.

    Abstract translation: 描述使用基于匹配规则的键匹配对象。 系统基于匹配规则生成匹配规则密钥,其中匹配规则指定两个对象是否匹配。 系统通过将匹配规则键应用于数据对象来创建候选键。 系统通过将匹配规则键应用于探针对象来创建探测键。 系统确定探测键是否匹配候选键。 如果探测键与候选对象相对应的候选键匹配,则系统基于将匹配规则应用于探测对象和候选对象来确定探测对象是否匹配候选对象。 如果探测对象与候选对象匹配,系统将基于匹配规则将探测对象和候选对象标识为匹配。

Patent Agency Ranking