Web smart exploration and management in browser

    公开(公告)号:US11748436B2

    公开(公告)日:2023-09-05

    申请号:US17483714

    申请日:2021-09-23

    摘要: In an approach for detecting web browsing subject-oriented event interactions and intelligently organizing web pages based on insights from important interactions for better exploration and efficient management, a processor extracts time series data associated with a plurality of web browsing events based on browsing historical actions of a user. A processor identifies the subject of each web browsing event. A processor determines major events based on the time series data and subjects of the plurality of web browsing events. A processor organizes the plurality of web browsing events based on subject hierarchy and timeline from the time series data. A processor highlights one or more uniform resource locators based on the subject hierarchy and timeline.

    INCREMENTAL MACHINE LEARNING FOR A PARAMETRIC MACHINE LEARNING MODEL

    公开(公告)号:US20230137184A1

    公开(公告)日:2023-05-04

    申请号:US17453540

    申请日:2021-11-04

    IPC分类号: G06N20/00 G06K9/62

    摘要: A method, system, and computer program product for incremental machine learning for a parametric machine learning model are disclosed. The method may include processing samples comprising historical samples and new samples with an existing parametric machine learning model to obtain at least one prediction residual of each of the samples, wherein the existing parametric machine learning model was trained based on the historical samples. The method may further include clustering the samples based on the at least one prediction residual of each of the samples and features of each of the samples. The method may further include sampling samples in each cluster to ensure that each cluster includes substantially similar number of sampled samples. The method may further include updating the existing parametric machine learning model to obtain an updated parametric machine learning model based on sampled samples in each cluster.

    Feature Generation for Training Data Sets Based on Unlabeled Data

    公开(公告)号:US20230073137A1

    公开(公告)日:2023-03-09

    申请号:US17447258

    申请日:2021-09-09

    IPC分类号: G06N20/00 G06K9/62

    摘要: A computer implemented method for machine learning model training. A number of processor units creates a cluster model comprising labeled samples and unlabeled samples. The number of processor units identifies cluster information for the labeled samples from the cluster model. The number of processor units adds a set of new features to a set of original features for the labeled samples using the cluster information to form an extended set of features for the labeled samples, wherein the labeled samples with the set of original features and the set of new features form a training data set for training a machine learning model.

    ARTIFICIAL INTELLIGENCE MODEL GENERATION USING DATA WITH DESIRED DIAGNOSTIC CONTENT

    公开(公告)号:US20220101044A1

    公开(公告)日:2022-03-31

    申请号:US17035816

    申请日:2020-09-29

    IPC分类号: G06K9/62 G06N20/00

    摘要: A computer receives a general predictive model and training data. The computer builds a clustering feature tree model to condense the training data into data groups. The computer applies a leave-one-out evaluation method to determine an impact value for each data groups with regard to said general predictive model. The computer identifies a diagnostic category for each data group selected from a list of categories including model-harmful data, model-neutral data, and model-helping data, in accordance with said impact value. The computer removes data in groups labelled as model-harmful from the training data and builds a modified general predictive model based on data in groups labelled as model-neutral or model-helping.

    Efficient execution of a decision tree

    公开(公告)号:US12093838B2

    公开(公告)日:2024-09-17

    申请号:US17027688

    申请日:2020-09-21

    摘要: Embodiments of the present disclosure relate to a method, system, and computer program product for efficient execution of a decision tree. According to the method, respective target values of a plurality of attributes of a target entity are obtained. Representations of a plurality of leaf nodes of a decision tree are obtained. Each of the representations indicates respective statistic values of a plurality of attributes of historical entities and a statistic prediction result determined from historical prediction results output at a respective one of the plurality of leaf nodes for the historical entities. Distance measures between the target entity and the plurality of leaf nodes are determined based on the target values and the statistic values indicated by the representations of the plurality of leaf nodes. A target prediction result for the target entity is determined based on the distance measures and the statistic prediction results of the historical entities.

    PARTIAL IMPORTANCE OF INPUT VARIABLE OF PREDICTIVE MODELS

    公开(公告)号:US20230394326A1

    公开(公告)日:2023-12-07

    申请号:US17804935

    申请日:2022-06-01

    IPC分类号: G06N5/02

    CPC分类号: G06N5/022

    摘要: Embodiments of the present disclosure relate to a method, system, and computer program product for predictive models. According to the method, a processor may provide a first list including at least one input variable of a predictive model and a second list including a plurality of variables of the predictive model. For each of input variables in the second list, the processor may determine contribution of the input variable to prediction of the predictive model with respect to the at least one input variable in the first list. The processor may update the first list by moving an input variable in the second list into the first list based on the determined contribution of the plurality of input variables. The processor may render one or more of input variables in the updated first list based on an order of the input variables in the updated first list.