CALIBRATED MODEL INTERVENTION WITH CONFORMAL THRESHOLD

    公开(公告)号:US20240330772A1

    公开(公告)日:2024-10-03

    申请号:US18618757

    申请日:2024-03-27

    IPC分类号: G06N20/00

    CPC分类号: G06N20/00

    摘要: A classification model is calibrated with a conformal threshold to determine a known error rate for classifications. Rather than directly use the model outputs, the classification model outputs are processed to a conformal score that is compared with a conformal threshold for determining whether a data sample is a member of a class. When a number of classes for the data sample that pass the conformal threshold for inclusion is a single class, an action associated with the class can confidently be applied with a known error rate. When the number of classes is zero or multiple classes, it may indicate sufficient uncertainty in the model prediction and the data sample may be escalated to another decision mechanism, such as manual review or a more complex classification model.

    TEXT-CONDITIONED VIDEO REPRESENTATION
    2.
    发明公开

    公开(公告)号:US20230351753A1

    公开(公告)日:2023-11-02

    申请号:US17894738

    申请日:2022-08-24

    IPC分类号: G06V20/40

    CPC分类号: G06V20/47 G06V20/41

    摘要: A text-video recommendation model determines relevance of a text to a video in a text-video pair (e.g., as a relevance score) with a text embedding and a text-conditioned video embedding. The text-conditioned video embedding is a representation of the video used for evaluating the relevance of the video to the text, where the representation itself is a function of the text it is evaluated for. As such, the input text may be used to weigh or attend to different frames of the video in determining the text-conditioned video embedding. The representation of the video may thus differ for different input texts for comparison. The text-conditioned video embedding may be determined in various ways, such as with a set of the most-similar frames to the input text (the top-k frames) or may be based on an attention function based on query, key, and value projections.