METHOD AND SYSTEM FOR CREATING INDICES AND LOADING KEY-VALUE PAIRS FOR NOSQL DATABASES
    11.
    发明申请
    METHOD AND SYSTEM FOR CREATING INDICES AND LOADING KEY-VALUE PAIRS FOR NOSQL DATABASES 有权
    创建指标的方法和系统,并为NOSQL数据库加载键对

    公开(公告)号:US20130339366A1

    公开(公告)日:2013-12-19

    申请号:US13860220

    申请日:2013-04-10

    CPC classification number: G06F17/30587 G06F17/30303 G06F17/30321

    Abstract: Systems and methods are provided for creating indices and loading key-value pairs for NoSQL databases. Attributes are created that correspond to records in a NoSQL database based on corresponding record fields. An index is created based on the attributes. A memory is loaded with attributes that correspond to a subset of the index as keys in a key-value pair and identifiers that correspond to records that correspond to the attributes as values in the key-value pair. The attributes that correspond to the subset of the index are sorted in the memory. Any duplicate attributes are identified from the sorted attributes in the memory. Any identifiers that correspond to any duplicate attributes also identify records in the NoSQL database to be evaluated as potential duplicate records.

    Abstract translation: 系统和方法用于为NoSQL数据库创建索引和加载键值对。 根据相应的记录字段创建与NoSQL数据库中的记录相对应的属性。 基于属性创建索引。 存储器加载了与索引的子集对应的属性作为键值对中的键以及对应于作为键值对中的值的属性的记录的标识符。 对应于索引子集的属性在内存中排序。 从内存中排序的属性中识别出任何重复的属性。 与任何重复属性相对应的任何标识符也标识要被评估为潜在重复记录的NoSQL数据库中的记录。

    Cross objects de-duplication
    13.
    发明授权

    公开(公告)号:US10949395B2

    公开(公告)日:2021-03-16

    申请号:US15085588

    申请日:2016-03-30

    Abstract: Some embodiments of the present invention include a method for determining duplicate records in multiple objects and may include combining records associated with a first object with records associated with a second object to generate a third object, wherein the first object is related to the second object; performing de-duplication on the third object to generate a combined group of duplicate sets; and from the combined group of duplicate sets, identifying at least one duplicate set associated with both the first object and the second object based on the duplicate set having at least one record associated with the first object and at least one record associated with the second object.

    Augmenting match indices
    14.
    发明授权

    公开(公告)号:US10817549B2

    公开(公告)日:2020-10-27

    申请号:US15590371

    申请日:2017-05-09

    Abstract: System creates three tries based on values stored in first three fields by records. System associates node in third trie with record, based on value stored in third field by record. System associates node with first dispersion measure, based on values stored in first field by records associated with node, and with second dispersion measure, based on values stored in second field by records associated with node. System identifies branch sequence in third trie as key for prospective record, based on value stored in third field by prospective record. System uses key to identify a subset of records that match prospective record. If a count of the subset exceeds threshold, the system identifies other branch sequence in first trie or second trie as other key for prospective record, based on first dispersion measure and second dispersion measure. System uses the key and the other key to identify at least one record that matches prospective record.

    MANAGING AUTHORIZATION TOKENS FOR CALLING THIRD-PARTY VENDORS

    公开(公告)号:US20190230081A1

    公开(公告)日:2019-07-25

    申请号:US15877703

    申请日:2018-01-23

    Abstract: Managing authorization tokens for calling third-party vendors is described. A system identifies a current call from a client computing system to an API associated with a third-party vendor, the call including a configuration file for calling the API. The system determines, in response to a determination that a previous call was made to the API, whether a previous call was made to the API and whether a part of the configuration file in the current call matches a corresponding configuration file in the previous call. The system enables the API to authorize the current call by sending a copy of a previous authorization token to the client computing system, in response to a determination that the part of the configuration file in the current call matches the corresponding part of the configuration file in the previous call, the previous authorization being previously received from the API for the previous call.

    Bulk deduplication detection
    16.
    发明授权

    公开(公告)号:US10152497B2

    公开(公告)日:2018-12-11

    申请号:US15052382

    申请日:2016-02-24

    Abstract: Some embodiments of the present invention include a system and method for removing duplicate records from a group of records in a database system. The method includes generating a first cluster of records from the group of records, generating a second cluster of records from the group of records, identifying sets of duplicate records in the first cluster of records, and identifying sets of duplicate records in the second cluster of records. The method also includes merging at least two sets of duplicate records associated with both the first cluster and the second cluster of records to form a merged set of duplicate records. The merging is performed based on the at least two sets of duplicate records having a common record. Duplicate records in the group of records may then be removed by removing duplicate records from the merged set of duplicate records.

Patent Agency Ranking