Identifying clusters with anomaly detection

    公开(公告)号:US11743280B1

    公开(公告)日:2023-08-29

    申请号:US17877817

    申请日:2022-07-29

    申请人: INTUIT INC.

    IPC分类号: H04L9/40

    CPC分类号: H04L63/1425 H04L63/102

    摘要: A method identifying clusters with anomaly detection. The method includes aggregating a set of events, of a user, to generate a user vector in response to identifying an event of the set of events. The method further includes aggregating a set of user vectors to a periodic vector for a time period. The method further includes processing a set of periodic vectors to generate a periodic distance. The method further includes selecting the time period, corresponding to the periodic vector, using the periodic distance and a threshold. The method further includes processing the set of user vectors to generate clusters of user vectors, wherein the set of user vectors includes the event during the time period. The method further includes processing the clusters of user vectors to identify a selected cluster and performing an action to a set of user accounts corresponding to the selected cluster.

    Extracting structural information using machine learning

    公开(公告)号:US11816912B1

    公开(公告)日:2023-11-14

    申请号:US18326735

    申请日:2023-05-31

    申请人: INTUIT INC.

    IPC分类号: G06V30/414 G06V10/77

    CPC分类号: G06V30/414 G06V10/7715

    摘要: The present disclosure provides techniques for extracting structural information using machine learning. One example method includes receiving electronic data indicating one or more pages, constructing, for each page of the one or more pages, a tree based on the page, wherein each level of the tree includes one or more nodes corresponding to elements in a level of elements in the page, encoding, for each page of the one or more pages, a value of each node of the tree for the page into a vector using a first machine learning model, sampling a plurality of pairs of vectors from the one or more trees for the one or more pages, wherein a given pair of vectors corresponds to values of nodes in a same tree, training a second machine learning model using the plurality of pairs, and combining each vector with weights of the second machine learning model.

    FEATURE RANDOMIZATION FOR SECURING MACHINE LEARNING MODELS

    公开(公告)号:US20220237482A1

    公开(公告)日:2022-07-28

    申请号:US17159463

    申请日:2021-01-27

    申请人: Intuit Inc.

    IPC分类号: G06N5/04 G06N20/00

    摘要: Feature randomization for securing machine learning models includes receiving an event, and altering, responsive to receiving the event, a threshold pseudo-randomly to generate an altered threshold value. Feature randomization further includes applying the altered threshold value to a threshold-dependent feature to generate an altered threshold-dependent feature value. The altered threshold-dependent feature value determined at least in part from the event. Feature randomization further includes executing a machine learning model, on the event and the altered threshold-dependent feature value, to generate a predicted event type for the event.