-
公开(公告)号:US10949395B2
公开(公告)日:2021-03-16
申请号:US15085588
申请日:2016-03-30
Applicant: salesforce.com, inc.
Inventor: Dai Duong Doan , Arun Kumar Jagota , Chenghung Ker , Parth Vaishnav , Danil Dvinov , Dmytro Kudriavtsev
IPC: G06F16/174 , G06F16/23
Abstract: Some embodiments of the present invention include a method for determining duplicate records in multiple objects and may include combining records associated with a first object with records associated with a second object to generate a third object, wherein the first object is related to the second object; performing de-duplication on the third object to generate a combined group of duplicate sets; and from the combined group of duplicate sets, identifying at least one duplicate set associated with both the first object and the second object based on the duplicate set having at least one record associated with the first object and at least one record associated with the second object.
-
公开(公告)号:US10942906B2
公开(公告)日:2021-03-09
申请号:US16026819
申请日:2018-07-03
Applicant: salesforce.com, inc.
Inventor: Alexandre Hersans , Swaroop Shere , Chenghung Ker , Parth Vijay Vaishnav , Assaf Ben-Gur , Victor Weilin Liu , Daniel McGarry , Samatha Sanikommu
IPC: G06F16/00 , G06F16/215 , G06F21/60 , G06Q30/00 , G06F16/22 , G06F16/23 , G06F16/2458
Abstract: Disclosed herein are system, method, and computer program product embodiments for detecting duplicates with exact and fuzzy matching on encrypted match indexes using an encryption key in a cloud computing platform. An embodiment operates by determining a match rule index value upon reception of a new record. The embodiment encrypts the match index rule value using the customer's encryption key and a deterministic encryption method and stores the encrypted match rule index value. Duplicate detection may be later performed by using the same deterministic encryption method to determine a cypher text for a candidate entry and comparing the ciphertext to the stored encrypted match indexes.
-
13.
公开(公告)号:US20200250189A1
公开(公告)日:2020-08-06
申请号:US16855671
申请日:2020-04-22
Applicant: salesforce.com, inc.
Inventor: Danil Dvinov , Parth Vijay Vaishnav , Chenghung Ker
IPC: G06F16/2453 , G06F16/2458 , G06F16/242
Abstract: Methods for splitting a plurality of records into subsets for parallel processing by separate threads are disclosed. In one embodiment, a plurality of records are added to a database. The adding includes tracking, for each unique value found in the plurality of records within a selected one of the sequence of fields, a statistic based on the values found in the plurality of records within a next one of the sequence of fields. Then a plurality of queries are generated, each includes a row value constructor (RVC) with a logical operation, a specific one of the unique values found within the selected one of the sequence of fields, and, for that specific value, the tracked statistic. Then a plurality of threads are caused to execute that each submit one of the queries and operate on a subset of the plurality of records responsive to execution of that query.
-
公开(公告)号:US10733613B2
公开(公告)日:2020-08-04
申请号:US15397824
申请日:2017-01-04
Applicant: salesforce.com, inc.
Inventor: Parth Vijay Vaishnav , Danil Dvinov , Chenghung Ker , Dai Duong Doan
IPC: G06F16/215 , G06Q30/00
Abstract: Some embodiments of the present invention include a method for performing data assessment. The method includes receiving, by a first server computing system, a request to assess first data stored in a storage device associated with a second server computing system, the request including one or more rules indicating how the first data is to be assessed; performing, by the first server computing system, an assessment of the first data based on the one or more rules and based on a data source associated with the first server computing system, the data source including data provided by one or more data providers, the assessment of the first data including assessing outdated information and missing information based on the data source; and providing, by the first server computing system, an assessment result from the assessment of the first data and an option to engage in an enrichment of the first data based on the assessment result and the data source.
-
公开(公告)号:US10521406B2
公开(公告)日:2019-12-31
申请号:US15282486
申请日:2016-09-30
Applicant: salesforce.com, inc.
Inventor: Parth Vaishnav , Chenghung Ker , Daniel McGarry , Danil Dvinov
IPC: G06F16/215 , G06F16/21 , G06F16/2455 , G06F16/28
Abstract: Some embodiments of the present invention include a method for merging nodes in hierarchies and include receiving, by a database system, a request to merge a first node in a first hierarchy of a plurality of nodes into a second node, the first node in the first hierarchy having child nodes; determining, by the database system, whether the merge request violates each of a plurality of constraints, the plurality of constraints comprising circular dependency of nodes, deleted node and ordering of nodes; and re-parenting, by the database system, in response to a determination that the merge request does not violate each of the plurality of constraints, the child nodes of the first node in the first hierarchy of nodes to the second node in response to the merging of the first node into the second node.
-
公开(公告)号:US10152497B2
公开(公告)日:2018-12-11
申请号:US15052382
申请日:2016-02-24
Applicant: salesforce.com, inc.
Inventor: Dai Duong Doan , Arun Kumar Jagota , Chenghung Ker , Parth Vaishnav , Danil Dvinov , Dmytro Kudriavtsev
Abstract: Some embodiments of the present invention include a system and method for removing duplicate records from a group of records in a database system. The method includes generating a first cluster of records from the group of records, generating a second cluster of records from the group of records, identifying sets of duplicate records in the first cluster of records, and identifying sets of duplicate records in the second cluster of records. The method also includes merging at least two sets of duplicate records associated with both the first cluster and the second cluster of records to form a merged set of duplicate records. The merging is performed based on the at least two sets of duplicate records having a common record. Duplicate records in the group of records may then be removed by removing duplicate records from the merged set of duplicate records.
-
-
-
-
-