Data segmentation using clustering

    公开(公告)号:US12124483B2

    公开(公告)日:2024-10-22

    申请号:US18181831

    申请日:2023-03-10

    IPC分类号: G06F16/28

    CPC分类号: G06F16/287

    摘要: Method includes obtaining sample records from dataset associated with user and including records associated with identifiers customers of user; executing first clustering using sample records, to obtain first set of clusters for first identifiers associated with sample records, first clustering using features associated with first identifiers; providing visualization of first set of clusters; determining whether user input for optimizing first set of clusters provided in visualization is received; when user input for optimizing first set of clusters is not received, determining first information related to first set of clusters as final result information; when user input for optimizing first set of clusters is received: executing second clustering using sample records, to obtain second set of clusters for first identifiers, second clustering using features associated with first identifiers, and determining second information related to second set of clusters as final result information; and clustering entire dataset using final result information.

    HIERARCHICAL VISUALIZATION OF CLUSTERED DATASETS

    公开(公告)号:US20240168979A1

    公开(公告)日:2024-05-23

    申请号:US17989202

    申请日:2022-11-17

    IPC分类号: G06F16/28 G06F16/22

    CPC分类号: G06F16/287 G06F16/2246

    摘要: Systems, methods, and other embodiments associated with converting a static cluster data table to a graphical hierarchical tree are described. In one embodiment, a method includes recursively traversing the static cluster data table to identify a root cluster, identify child clusters from the root cluster and child clusters from each other that define parent-child relationships, and identify decision segments that caused a segment split of cluster data. A 2-dimensional visual hierarchy is generated and displayed in a graphical form using a plurality of nodes that represent the root cluster and the child clusters along with path lines that connect the nodes. The 2-dimensional visual hierarchy displays a hierarchical visualization of the static cluster data table that shows an order of decision segments that occurred to segment a dataset and how the dataset was segmented by a clustering algorithm leading to a final cluster of a leaf node.

    SCORING CORRELATED INDEPENDENT VARIABLES FOR ELIMINATION FROM A DATASET

    公开(公告)号:US20230351211A1

    公开(公告)日:2023-11-02

    申请号:US17733420

    申请日:2022-04-29

    发明人: Mridul Kumar Nath

    IPC分类号: G06N5/04 G06N5/02

    CPC分类号: G06N5/022 G06N5/041

    摘要: Techniques are disclosed as an optimization data system for eliminating correlated independent variables programmatically from data with ranked exclusion scores. The system can obtain an initial dataset comprising variables, determine a set of correlation values by analyzing linear correlation between the variables, generate a correlation matrix using at least in part the set of correlation values and corresponding variables from the initial data, calculate exclusion scores for the variables in the correlation matrix that exhibit multicollinearity, and update the initial dataset by removing at least one variable with the highest exclusion score from the variables to generate an updated dataset comprising optimized variables. The steps for correlation and elimination of variables are iterated until an updated dataset without any correlation is obtained and then a machine learning model may be trained using the updated dataset.

    Below-the-line thresholds tuning with machine learning

    公开(公告)号:US11651375B2

    公开(公告)日:2023-05-16

    申请号:US17347940

    申请日:2021-06-15

    摘要: Systems, methods, and other embodiments for ML-Based automated below-the-line threshold tuning include, in one embodiment, training an ML model to predict probabilities that an event is fraudulent on a set of events (i) sampled from a set of historic events labeled by an alerting engine as either above-the-line events or below-the-line events on either side of a threshold line indicating that an event is suspicious, and (ii) confirmed to be either fraudulent or not fraudulent; determining that the alerting engine should be tuned based on differences between probability values predicted for the events by the trained machine learning model and the labels applied to the events; generating a tuned threshold value for the threshold line based at least in part on the probability values predicted by the machine learning model; and tuning the alerting engine by replacing a threshold value with the tuned threshold value to adjust the threshold line.

    Computing framework for compliance report generation

    公开(公告)号:US11544669B2

    公开(公告)日:2023-01-03

    申请号:US15632482

    申请日:2017-06-26

    摘要: Systems, methods, and other embodiments associated with a framework for compliance report generation are described. In one embodiment, a method includes receiving a data source definition of a set of data sources comprising data for populating compliance reports. The example method may also include retrieving a compliance report definition for a compliance report for a reporting entity. The example method may also include constructing and rendering a user interface populated with a set of user interface elements generated based upon the set of data sources and the compliance report definition. The example method may also include generating the compliance report according to the compliance report definition. The compliance report is populated with data from the set of data sources. The compliance report is sent over a computing network to a remote computing device of the reporting entity.

    Dependency graph-controlled object and compute pipeline migration

    公开(公告)号:US10969929B2

    公开(公告)日:2021-04-06

    申请号:US16430566

    申请日:2019-06-04

    摘要: Control migration of a state machine using a dependency graph interface by: analyzing a state machine to determine objects and dependencies between the objects; generating a dependency graph that represents the objects and the dependencies between the objects, wherein the objects are represented by selectable icons; displaying the dependency graph on a display device; in response to a selection of a particular selectable icon, providing a migration option for an object represented by the selectable icon, wherein the migration option includes at least a selection between either a deep copy or a shallow copy for the object represented by the selectable icon; accepting and storing a selection of the migration option for the object represented by the particular selectable icon; and migrating the state machine to a target environment based at least in part on performing the migration option for the object represented by the particular selectable icon.

    COMPUTERIZED CONTROL OF EXECUTION PIPELINES

    公开(公告)号:US20210055972A1

    公开(公告)日:2021-02-25

    申请号:US17089906

    申请日:2020-11-05

    IPC分类号: G06F9/50 G06F9/48

    摘要: Systems, methods, and other embodiments associated with controlling an execution pipeline are described. In one embodiment, a method includes generating an execution pipeline for executing a plurality of tasks. The example method may also include evaluating execution definitions of the tasks to identify execution properties of the plurality of tasks. The example method may also include assigning each task to an execution environment selected from a set of execution environments based upon execution properties of the task matching execution properties of the execution environments. The example method may also include controlling the execution pipeline to execute each task within the assigned execution environments.

    Computing scenario forecasts using electronic inputs

    公开(公告)号:US10460010B2

    公开(公告)日:2019-10-29

    申请号:US15201871

    申请日:2016-07-05

    摘要: Systems, methods, and other embodiments associated with computing scenario forecasts according to electronic inputs are described. In one embodiment, a method includes, in response to receiving a signal that triggers data collection, collecting electronic data from one or more electronic databases by aggregating the electronic data into data structures of a processing table. The electronic data defines historic values of a set of instruments. The method also includes computing projected values for each of the set of instruments according to correlations identified in the historic values. The projected values form primary forecasts that model expected future values of the set of instruments. The method includes, in response to receiving electronic inputs including scenario variables of a scenario that affects the primary forecasts, generating scenario forecasts for the set of instruments according to the scenario variables and the projected values to identify how the scenario influences the primary forecasts.

    Computer system and method for executing applications with new data structures

    公开(公告)号:US10152318B2

    公开(公告)日:2018-12-11

    申请号:US15420332

    申请日:2017-01-31

    IPC分类号: G06F9/44 G06F8/656

    摘要: Systems, methods, and other embodiments associated with introducing a new data structure to an executing application are described. In one embodiment, a method includes executing an application as an executing application to process data of a data structure maintained according to a data model. The example method may also include receiving a new data structure definition of a new data structure to define for the data model. The example method may also include performing impact analysis to determine whether the executing application is capable of processing data of the new data structure. The example method may also include updating the data model to include the new data structure definition to create an updated data model. The example method may also include generating control instructions to instruct the executing application to utilize data from the new data structure according to the updated data model.