-
公开(公告)号:US20210157847A1
公开(公告)日:2021-05-27
申请号:US17163081
申请日:2021-01-29
Applicant: salesforce.com, inc.
Inventor: Nathan Irace Burke , Kexin Xie , Xingyu Wang , Wanderley Liu , David Yourdon
IPC: G06F16/906 , G06K9/62 , G06F17/18
Abstract: A data processing server may receive a set of data objects for frequent pattern (FP) analysis. The set of data objects may be analyzed using an attribute diversity technique. For the set of data attributes of the set of data objects, the server may arrange the attributes in one or more dimensions. The server may initialize a set of centroids on data points and identify mean values of nearby data points. Based on an iteration of the mean value calculation, the server may identify a set of attributes corresponding to final mean values as being groups of similarly frequent attributes. These groups of similarly frequent attributes may be analyzed using an FP analysis procedure to identify frequent patterns of data attributes.
-
公开(公告)号:US20200322307A1
公开(公告)日:2020-10-08
申请号:US16506773
申请日:2019-07-09
Applicant: salesforce.com, inc.
Inventor: Yuxi Zhang , Kexin Xie , Sheng Loong Su , Shrestha Basu Mallick
Abstract: A cloud platform supports a digital communication system that identifies recommended communication frequencies based on past communication data. The cloud platform may support blending of weights applied to different engagement rates. Based on the weights, the system identifies recommended frequency ranges to maximize engagement rates, including the blended engagement rate using a redistribution simulation process.
-
公开(公告)号:US12236264B2
公开(公告)日:2025-02-25
申请号:US17163386
申请日:2021-01-30
Applicant: salesforce.com, inc.
Inventor: Yuxi Zhang , Kexin Xie
Abstract: Systems, devices, and techniques are disclosed for data shards for distributed processing. Data sets of data for users may be received. The data sets may belong to separate groups. User identifiers in the data sets may be hashed to generate hashed identifiers for the data sets. The user identifiers in the data sets may be replaced with the hashed identifiers. The data sets may be split to generate shards. The data sets may be split into the same number of shards. Merged shards may be generated by merging the shards using a separate running process for each of the merged shards. The merged shards may be generated using shards from more than one of the two or more data sets. An operation may be performed on all of the merged shards.
-
公开(公告)号:US20220207407A1
公开(公告)日:2022-06-30
申请号:US17134430
申请日:2020-12-27
Applicant: salesforce.com, inc.
Inventor: Yuxi Zhang , Kexin Xie
Abstract: Systems, devices, and techniques are disclosed for localization of machine learning models trained with global data. Data sets of event data for users may be received. The data sets may belong to separate groups. The data sets of event data may be combined to generate a global data set. A matrix factorization model may be trained using the global data set to generate a globally trained matrix factorization model. A localization group data set may be generated including event data from the global data set for users from a first of the groups. The globally trained matrix factorization model may be trained with the localization group data set to generate a localized matrix factorization model for the first of the groups.
-
公开(公告)号:US11275768B2
公开(公告)日:2022-03-15
申请号:US16120067
申请日:2018-08-31
Applicant: salesforce.com, inc.
Inventor: Yacov Salomon , Kexin Xie , Wanderley Liu
IPC: G06F16/28 , G06F16/2458 , G06F16/906 , G06N5/02
Abstract: Methods, systems, and devices supporting differential support for frequent pattern (FP) analysis are described. Some database systems may analyze data sets to determine FPs of data attributes within the data sets. However, if data distributions for different types of data attributes vary greatly, more frequent data attribute types may skew the FPs away from the less frequent types. To reduce the noise of common attributes while maintaining sensitivity to the less common attributes, the database system may implement multiple minimum support (e.g., frequency) thresholds. For example, the database system may adaptively categorize the different data attribute types into data categories based on their distributions and may dynamically determine support thresholds for the categories. Using different minimum support thresholds for different data categories allows the system to filter out data attribute patterns based on the distributions of the data attribute types in the pattern.
-
公开(公告)号:US11061937B2
公开(公告)日:2021-07-13
申请号:US16144715
申请日:2018-09-27
Applicant: salesforce.com, inc.
Inventor: Yacov Salomon , Jonathan Purnell , Wanderley Liu , Kexin Xie
IPC: G06F16/00 , G06F16/28 , G06Q30/02 , G06F16/907 , G06F16/9535 , G06F16/34
Abstract: A database system performs lookalike analysis on a data set including a plurality of user identifiers, which are associated with one or more attribute records. The database system classifies the user identifiers into one or more segments of user identifiers based on the attribute records. The database system performs Linear Discriminant Analysis (LDA) to calculate a measure of importance of the attribute records relative to the one or more segments. The database system auto-correlates the attribute records based on the numbers of attribute records in the user identifier population and the one or more segments. The database system identifies a set of user identifiers relative to one or more segments using the measures of importance and the auto-correlated parameters.
-
公开(公告)号:US20210157974A1
公开(公告)日:2021-05-27
申请号:US16901656
申请日:2020-06-15
Applicant: salesforce.com, inc.
Inventor: Kexin Xie , Gokhan Cagrici , Daniel Keith Wilson , Shrestha Basu Mallick , Jonathan Daniel Showers Belkowitz , Jason Lestina , James Brewer , Daniel Louis Gasperut , Jeffrey Allen Zickgraf , Greg Lyman , Michael Ronald Brewer , Evan Black , Austin Rauschuber , Victoria Schultz , Matthew David Trepina , Peter Stadlinger
IPC: G06F40/166 , H04L12/58 , G06F40/216 , G06N20/00
Abstract: Methods, systems, and devices supporting data processing are described. In some systems, a data processing platform may support communication message analysis using machine learning. For example, a system may receive a set of communication messages (e.g., social media messages) and perform a machine learning process on the message contents and message interaction data to train a machine learned model. The system may further receive a subject line for a communication message for analysis, input the subject line into the machine learned model, and receive, as an output of the machine learned model, an engagement score based on the subject line. The engagement score may indicate an estimated probability that a user receiving the communication message opens the communication message (e.g., based on the subject line). A user—or the system—may modify the subject line based on the analysis to improve the engagement score.
-
公开(公告)号:US10963519B2
公开(公告)日:2021-03-30
申请号:US16355996
申请日:2019-03-18
Applicant: salesforce.com, inc.
Inventor: Nathan Irace Burke , Kexin Xie , Xingyu Wang , Wanderley Liu , David Yourdon
IPC: G06F16/906 , G06K9/62 , G06F17/18 , G06Q30/00
Abstract: A data processing server may receive a set of data objects for frequent pattern (FP) analysis. The set of data objects may be analyzed using an attribute diversity technique. For the set of data attributes of the set of data objects, the server may arrange the attributes in one or more dimensions. The server may initialize a set of centroids on data points and identify mean values of nearby data points. Based on an iteration of the mean value calculation, the server may identify a set of attributes corresponding to final mean values as being groups of similarly frequent attributes. These groups of similarly frequent attributes may be analyzed using an FP analysis procedure to identify frequent patterns of data attributes.
-
公开(公告)号:US11556595B2
公开(公告)日:2023-01-17
申请号:US17163081
申请日:2021-01-29
Applicant: salesforce.com, inc.
Inventor: Nathan Irace Burke , Kexin Xie , Xingyu Wang , Wanderley Liu , David Yourdon
IPC: G06F16/906 , G06F17/18 , G06K9/62 , G06Q30/00
Abstract: A data processing server may receive a set of data objects for frequent pattern (FP) analysis. The set of data objects may be analyzed using an attribute diversity technique. For the set of data attributes of the set of data objects, the server may arrange the attributes in one or more dimensions. The server may initialize a set of centroids on data points and identify mean values of nearby data points. Based on an iteration of the mean value calculation, the server may identify a set of attributes corresponding to final mean values as being groups of similarly frequent attributes. These groups of similarly frequent attributes may be analyzed using an FP analysis procedure to identify frequent patterns of data attributes.
-
公开(公告)号:US11475207B2
公开(公告)日:2022-10-18
申请号:US16901656
申请日:2020-06-15
Applicant: salesforce.com, inc.
Inventor: Kexin Xie , Gokhan Cagrici , Daniel Keith Wilson , Shrestha Basu Mallick , Jonathan Daniel Showers Belkowitz , Jason Lestina , James Brewer , Daniel Louis Gasperut , Jeffery Allen Zickgraf , Greg Lyman , Michael Ronald Brewer , Evan Black , Austin Rauschuber , Victoria Schultz , Matthew David Trepina , Peter Stadlinger
IPC: G06F40/166 , G06N20/00 , G06F40/216 , H04L51/52
Abstract: Methods, systems, and devices supporting data processing are described. In some systems, a data processing platform may support communication message analysis using machine learning. For example, a system may receive a set of communication messages (e.g., social media messages) and perform a machine learning process on the message contents and message interaction data to train a machine learned model. The system may further receive a subject line for a communication message for analysis, input the subject line into the machine learned model, and receive, as an output of the machine learned model, an engagement score based on the subject line. The engagement score may indicate an estimated probability that a user receiving the communication message opens the communication message (e.g., based on the subject line). A user—or the system—may modify the subject line based on the analysis to improve the engagement score.
-
-
-
-
-
-
-
-
-