MATCHING OBJECTS USING MATCH RULES AND LOOKUP KEY
    1.
    发明申请
    MATCHING OBJECTS USING MATCH RULES AND LOOKUP KEY 审中-公开
    使用匹配规则和查找关键词匹配对象

    公开(公告)号:US20160350440A1

    公开(公告)日:2016-12-01

    申请号:US14723866

    申请日:2015-05-28

    CPC classification number: G06F17/30979 G06F17/30303

    Abstract: Some embodiments of the present invention include a method for identifying match candidates in a database object and may include generating, by a database system, a match key associated with a lookup field of a database object. The method may further include activating, by the database system, a matching rule associated with the match key, and receiving, by the database system, a request to search for duplicate candidates in the database object. The request may include an input entity. The duplicate candidates in the database object may be identified by using the input entity and the matching rule.

    Abstract translation: 本发明的一些实施例包括用于识别数据库对象中的匹配候选的方法,并且可以包括由数据库系统生成与数据库对象的查找字段相关联的匹配密钥。 该方法可以进一步包括由数据库系统激活与匹配关键字相关联的匹配规则,以及由数据库系统接收在数据库对象中搜索重复候选的请求。 请求可以包括输入实体。 可以通过使用输入实体和匹配规则来识别数据库对象中的重复候选。

    METHODS AND SYSTEMS FOR PERFORMING DATA ASSESSMENT

    公开(公告)号:US20180189792A1

    公开(公告)日:2018-07-05

    申请号:US15397824

    申请日:2017-01-04

    CPC classification number: G06Q30/01 G06F16/215

    Abstract: Some embodiments of the present invention include a method for performing data assessment. The method includes receiving, by a first server computing system, a request to assess first data stored in a storage device associated with a second server computing system, the request including one or more rules indicating how the first data is to be assessed; performing, by the first server computing system, an assessment of the first data based on the one or more rules and based on a data source associated with the first server computing system, the data source including data provided by one or more data providers, the assessment of the first data including assessing outdated information and missing information based on the data source; and providing, by the first server computing system, an assessment result from the assessment of the first data and an option to engage in an enrichment of the first data based on the assessment result and the data source.

    DATABASE SCHEMA FOR EFFICIENT DATA ASSESSMENT

    公开(公告)号:US20180189330A1

    公开(公告)日:2018-07-05

    申请号:US15438434

    申请日:2017-02-21

    CPC classification number: G06F16/211 G06F16/2282

    Abstract: A computing system may create a database schema representing tables for data assessment, organization identification (ID), rule ID, data source ID, and data assessment ID. The data assessment table may include a first field representing an organization ID, a second field representing a rule ID, a third field representing a data source ID, wherein the fields correspond to a records in the organization ID table, rule ID table and data source ID table, respectively. The data assessment table may also include a fourth field representing a match status indicating whether a record in data assessment ID table matches a record of a data source associated with the third field, a fifth field representing a data assessment record ID and corresponding to a record in data assessment record ID table, and a sixth field configured as a column family and representing a field of the business object.

    USING ROW VALUE CONSTRUCTOR (RVC) BASED QUERIES TO GROUP RECORDS OF A DATABASE FOR MULTI-THREAD EXECUTION

    公开(公告)号:US20190236184A1

    公开(公告)日:2019-08-01

    申请号:US15884322

    申请日:2018-01-30

    Abstract: Methods for splitting a plurality of records into subsets for parallel processing by separate threads are disclosed. In one embodiment, a plurality of records are added to a database. The adding includes tracking, for each unique value found in the plurality of records within a selected one of the sequence of fields, a statistic based on the values found in the plurality of records within a next one of the sequence of fields. Then a plurality of queries are generated, each includes a row value constructor (RVC) with a logical operation, a specific one of the unique values found within the selected one of the sequence of fields, and, for that specific value, the tracked statistic. Then a plurality of threads are caused to execute that each submit one of the queries and operate on a subset of the plurality of records responsive to execution of that query.

    MERGING MULTIPLE GROUPS OF RECORDS CONTAINING DUPLICATES

    公开(公告)号:US20170293629A1

    公开(公告)日:2017-10-12

    申请号:US15094232

    申请日:2016-04-08

    CPC classification number: G06F16/1748 G06F16/2365 G06F16/24556

    Abstract: The method includes generating a first graphical representation of a first group of records, including a root node associated with a first representative record of a duplicate record in the first group, the first graphical representation associated with a rank corresponding to a height of the first graphical representation. The method further includes forming a map to reflect the first graphical representation and first representative record. A second graphical representation of a second group of records is generated including a root node associated with a second representative record in the second group. The method compares each of the records in the second group of records with the map to determine if there is a sharing of a duplicate record in the second group with the first group. The first and second graphical representations are merged based on the second group of records sharing a duplicate record with the first group.

    OPTIMIZED SUBSET PROCESSING FOR DE-DUPLICATION

    公开(公告)号:US20170242891A1

    公开(公告)日:2017-08-24

    申请号:US15052556

    申请日:2016-02-24

    CPC classification number: G06F16/24556 G06F7/32 G06F16/2455 G06F16/285

    Abstract: Some embodiments of the present invention include a method for identifying duplicate records from a group of records in a database system. The method includes generating a cluster of records from a group of records based on one or more keys; splitting the cluster of records into multiple subsets of records with each subset of records having fewer number of records than the cluster of records, wherein the splitting the cluster of records into multiple subsets of records is based on a number of records in the cluster of records exceeding a threshold; causing duplicate sets of records in each of the subsets of records to be identified, wherein a duplicate set of records includes one or more records, and wherein when a duplicate set of records includes two or more records, the two or more records are duplicates of one another; merging all of the duplicate sets of records identified from the multiple subsets of records forming a first group of duplicate sets of records; and forming a representative set of records based on selecting a representative record from each of the duplicate sets in the first group of duplicate sets of records.

    BULK DUPLICATION DETECTION SUPPORTING DATA ENCRYPTION

    公开(公告)号:US20210326311A1

    公开(公告)日:2021-10-21

    申请号:US16848878

    申请日:2020-04-15

    Abstract: Provided herein are system, methods and computer program products for identifying duplicate records stored in a database system, comprising: generating a plurality of encrypted match indexes for each of a plurality of records stored in the database system, each of the plurality of encrypted match indexes encrypts a value of each encryption enabled field of a respective one of the plurality of records defined by at least one match rule, creating a cluster of records comprising at least one set containing at least two records of the plurality of records, the at least two records having respective encrypted match indexes corresponding to the at least one match rule, causing identification of duplicate records in the at least one set according to detection of records of the at least one set having respective match indexes matching the at least one match rule, and outputting an indication of the identified duplicate records.

    METHODS AND SYSTEMS FOR PROTECTING COMPUTING RESOURCES

    公开(公告)号:US20180183671A1

    公开(公告)日:2018-06-28

    申请号:US15388845

    申请日:2016-12-22

    CPC classification number: H04L67/10 H04L43/0817

    Abstract: Some embodiments of the present invention include a method for controlling requests to be transmitted to a server computing system. The method generating, by a first requester, a request to be transmitted to a first server computing system, the first requester being part of a first group of requesters, each of the requesters in the first group of requesters is configured to access an indicator to determine whether the first server computing system is ready to receive its request, the indicator being set or reset at least based on a response received from the first server computing system, the indicator stored in a memory device associated with a second server computing system; and based on the indicator indicating that the first server computing system is not ready, delaying from transmitting, by the first requester, the request to the first server computing system.

    MERGING ALONG OBJECT HIERARCHIES
    9.
    发明申请

    公开(公告)号:US20180096003A1

    公开(公告)日:2018-04-05

    申请号:US15282486

    申请日:2016-09-30

    Abstract: Some embodiments of the present invention include a method for merging nodes in hierarchies and include receiving, by a database system, a request to merge a first node in a first hierarchy of a plurality of nodes into a second node, the first node in the first hierarchy having child nodes; determining, by the database system, whether the merge request violates each of a plurality of constraints, the plurality of constraints comprising circular dependency of nodes, deleted node and ordering of nodes; and re-parenting, by the database system, in response to a determination that the merge request does not violate each of the plurality of constraints, the child nodes of the first node in the first hierarchy of nodes to the second node in response to the merging of the first node into the second node.

    CROSS OBJECTS DE-DUPLICATION
    10.
    发明申请

    公开(公告)号:US20170286441A1

    公开(公告)日:2017-10-05

    申请号:US15085588

    申请日:2016-03-30

    CPC classification number: G06F16/1748 G06F16/2365

    Abstract: Some embodiments of the present invention include a method for determining duplicate records in multiple objects and may include combining records associated with a first object with records associated with a second object to generate a third object, wherein the first object is related to the second object; performing de-duplication on the third object to generate a combined group of duplicate sets; and from the combined group of duplicate sets, identifying at least one duplicate set associated with both the first object and the second object based on the duplicate set having at least one record associated with the first object and at least one record associated with the second object.

Patent Agency Ranking