GENERATING NATURAL LANGUAGE MODEL INSIGHTS FOR DATA CHARTS USING LIGHT LANGUAGE MODELS DISTILLED FROM LARGE LANGUAGE MODELS

    公开(公告)号:US20240320421A1

    公开(公告)日:2024-09-26

    申请号:US18338033

    申请日:2023-06-20

    Applicant: Adobe Inc.

    CPC classification number: G06F40/186

    Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media for generating naturally phrased insights about data charts using light language models distilled from large language models. To synthesize training data for the light language model, in some embodiments, the disclosed systems leverage insight templates for prompting a large language model for generating naturally phrased insights. In some embodiments, the disclosed systems anonymize and augment the synthesized training data to improve the accuracy and robustness of model predictions. For example, the disclosed systems anonymize training data by injecting noise into data charts before prompting the large language model for generating naturally phrased insights from insight templates. In some embodiments, the disclosed systems further augment the (anonymized) training data by splitting or partitioning data charts into folds that act as individual data charts.

    TIME-SERIES ANOMALY DETECTION
    14.
    发明公开

    公开(公告)号:US20240169258A1

    公开(公告)日:2024-05-23

    申请号:US18057883

    申请日:2022-11-22

    Applicant: Adobe Inc.

    CPC classification number: G06N20/00

    Abstract: In implementations of systems for time-series anomaly detection, a computing device implements an anomaly system to receive, via a network, time-series data describing continuously observed values separated by a period of time. The anomaly system computes updated estimated parameters of a predictive model for the time-series data by performing a rank one update on previously estimated parameters of the predictive model. An uncertainty interval for a future observed value is generated using the predictive model with the updated estimated parameters. The anomaly system determines that an observed value corresponding to the future observed value is outside of the uncertainty interval. An indication is generated that the observed value is an anomaly.

    DETERMINING DATA LOSS FOR INTERNET TRAFFIC DATA

    公开(公告)号:US20240119831A1

    公开(公告)日:2024-04-11

    申请号:US17938742

    申请日:2022-10-07

    Applicant: Adobe Inc.

    CPC classification number: G08G1/0145 G08G1/0133 G08G1/0141

    Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media that determine internet traffic data loss from internet traffic data including bulk ingested data utilizing an internet traffic forecasting model. In particular, the disclosed systems detect that observed internet traffic data includes bulk ingested internet traffic data. In addition, the disclosed systems determine a predicted traffic volume for an outage period from the bulk ingested internet traffic data utilizing an internet traffic forecasting model. The disclosed systems further generate a decomposed predicted traffic volume for the outage period. The disclosed systems also determine an internet traffic data loss for the outage period from the decomposed predicted traffic volume while calibrating for pattern changes and late data from previous periods.

    Automatic identification of impermissable account sharing

    公开(公告)号:US10992972B1

    公开(公告)日:2021-04-27

    申请号:US16731406

    申请日:2019-12-31

    Applicant: ADOBE INC.

    Abstract: The present disclosure relates to a method for detecting impermissible account sharing among user accounts of a streaming media service including the steps of determining a plurality of locations accessed by a given user account of the user accounts; determining a device access count for each of the locations, the device access count indicating how many times the corresponding location was accessed by at least one device associated with the given user account; identifying one of the locations having the highest device access count as a base location; calculating a risk coefficient for each remaining location; generating a sharing score for the given user account by summing the risk coefficients; and determining impermissible account sharing of the given user account has occurred when the sharing score exceeds a threshold.

    SELECTING ATTRIBUTES BY PROGRESSIVE SAMPLING TO GENERATE DIGITAL PREDICTIVE MODELS

    公开(公告)号:US20210117816A1

    公开(公告)日:2021-04-22

    申请号:US17136727

    申请日:2020-12-29

    Applicant: Adobe Inc.

    Abstract: The present disclosure includes methods and systems for generating digital predictive models by progressively sampling a repository of data samples. In particular, one or more embodiments of the disclosed systems and methods identify initial attributes for predicting a target attribute and utilize the initial attributes to identify a coarse sample set. Moreover, the disclosed systems and methods can utilize the coarse sample set to identify focused attributes pertinent to predicting the target attribute. Utilizing the focused attributes, the disclosed systems and methods can identify refined data samples and utilize the refined data samples to identify final attributes and generate a digital predictive model.

    FACILITATING EFFICIENT AND EFFECTIVE ANOMALY DETECTION VIA MINIMAL HUMAN INTERACTION

    公开(公告)号:US20220292074A1

    公开(公告)日:2022-09-15

    申请号:US17200522

    申请日:2021-03-12

    Applicant: ADOBE INC.

    Abstract: Embodiments of the present technology provide systems, methods, and computer storage media for facilitating anomaly detection. In some embodiments, a prediction model is generated using a training data set. The prediction model is used to predict an expected value for a latest (current) timestamp, which is used to determine that the incoming observed data value is an anomaly. Based on the incoming observed data value determined to be the anomaly or not, a corrected data value is generated to be included in the training data set. Thereafter, the training data set having the corrected data value is used to update the prediction model for use in determining whether a subsequent observed data value is anomalous. Such a process may be performed in an iterative manner to maintain optimized training data and prediction model.

    Selecting attributes by progressive sampling to generate digital predictive models

    公开(公告)号:US10885441B2

    公开(公告)日:2021-01-05

    申请号:US15388922

    申请日:2016-12-22

    Applicant: Adobe Inc.

    Abstract: The present disclosure includes methods and systems for generating digital predictive models by progressively sampling a repository of data samples. In particular, one or more embodiments of the disclosed systems and methods identify initial attributes for predicting a target attribute and utilize the initial attributes to identify a coarse sample set. Moreover, the disclosed systems and methods can utilize the coarse sample set to identify focused attributes pertinent to predicting the target attribute. Utilizing the focused attributes, the disclosed systems and methods can identify refined data samples and utilize the refined data samples to identify final attributes and generate a digital predictive model.

    SOFTWARE COMPONENT DEFECT PREDICTION USING CLASSIFICATION MODELS THAT GENERATE HIERARCHICAL COMPONENT CLASSIFICATIONS

    公开(公告)号:US20200241861A1

    公开(公告)日:2020-07-30

    申请号:US16259454

    申请日:2019-01-28

    Applicant: Adobe Inc.

    Abstract: Systems and methods for facilitating updates to software programs via machine-learning techniques are disclosed. In an example, an application generates a feature vector from a textual description of a software defect by applying a topic model to the textual description. The application uses the feature vector and one or more machine-learning models configured to predict classifications and sub-classifications of the textual description. The application integrates the classifications and the sub-classifications into a final classification of the textual description that indicates a software component responsible for causing the software defect. The final classification is usable for correcting the software defect.

Patent Agency Ranking