GENERALIZED NONLINEAR MIXED EFFECT MODELS VIA GAUSSIAN PROCESSES

    公开(公告)号:US20200380407A1

    公开(公告)日:2020-12-03

    申请号:US16430243

    申请日:2019-06-03

    IPC分类号: G06N20/00 G06N7/00

    摘要: In an example embodiment, training data is obtained, the training data comprising values for a plurality of different features. Then a global machine learned model is trained using a first machine learning algorithm by feeding the training data into the first machine learning algorithm during a fixed effect training process. A non-linear first random effects machine learned model is trained by feeding a subset of the training data into a second machine learning algorithm, the subset of the training data being limited to training data corresponding to a particular value of one of the plurality of different features.

    STREAM PROCESSING IN SEARCH DATA PIPELINES
    3.
    发明申请

    公开(公告)号:US20200293536A1

    公开(公告)日:2020-09-17

    申请号:US16816882

    申请日:2020-03-12

    IPC分类号: G06F16/2455

    摘要: Architecture that decomposes of one or more monolithic data concepts into atomic concepts and related atomic concept dependencies, and provides streaming data processing that processes individual or separate (atomic) data concepts and defined atomic dependencies. The architecture can comprise data-driven data processing that enables the plug-in of new data concepts with minimal effort. Efficient processing of the data concepts is enabled by streaming only required data concepts and corresponding dependencies and enablement of the seamless configuration of data processing between stream processing systems and batch processing systems as a result of data concept decomposition. Incremental and non-incremental metric processing enables realtime access and monitoring of operational parameters and queries.

    Compact entity identifier embeddings

    公开(公告)号:US11768874B2

    公开(公告)日:2023-09-26

    申请号:US16225888

    申请日:2018-12-19

    摘要: The disclosed embodiments provide a system for processing data. During operation, the system applies a first set of hash functions to a first entity identifier (ID) for a first entity to generate a first set of hash values. Next, the system produces a first set of intermediate vectors from the first set of hash values and a first set of lookup tables by matching each hash value in the first set of hash values to an entry in a corresponding lookup table in the first set of lookup tables. The system then performs an element-wise aggregation of the first set of intermediate vectors to produce a first embedding. Finally, the system outputs the first embedding for use by a machine learning model.