专利检索 ap:("THE TORONTO-DOMINION BANK") AND inv:"Noël Vouitsis" 第 1 页

1.

发明公开
CALIBRATED MODEL INTERVENTION WITH CONFORMAL THRESHOLD 审中-公开

公开(公告)号：US20240330772A1

公开(公告)日：2024-10-03

申请号：US18618757

申请日：2024-03-27

申请人： THE TORONTO-DOMINION BANK

发明人： Jesse Cole Cresswell , Noël Vouitsis , Yi Sui

IPC分类号： G06N20/00

CPC分类号： G06N20/00

摘要： A classification model is calibrated with a conformal threshold to determine a known error rate for classifications. Rather than directly use the model outputs, the classification model outputs are processed to a conformal score that is compared with a conformal threshold for determining whether a data sample is a member of a class. When a number of classes for the data sample that pass the conformal threshold for inclusion is a single class, an action associated with the class can confidently be applied with a known error rate. When the number of classes is zero or multiple classes, it may indicate sufficient uncertainty in the model prediction and the data sample may be escalated to another decision mechanism, such as manual review or a more complex classification model.

2.

发明公开
TEXT-CONDITIONED VIDEO REPRESENTATION 审中-公开

公开(公告)号：US20230351753A1

公开(公告)日：2023-11-02

申请号：US17894738

申请日：2022-08-24

申请人： THE TORONTO-DOMINION BANK

发明人： Satya Krishna Gorti , Junwei Ma , Guangwei Yu , Maksims Volkovs , Keyvan Golestan Irani , Noël Vouitsis

IPC分类号： G06V20/40

CPC分类号： G06V20/47 , G06V20/41

摘要： A text-video recommendation model determines relevance of a text to a video in a text-video pair (e.g., as a relevance score) with a text embedding and a text-conditioned video embedding. The text-conditioned video embedding is a representation of the video used for evaluating the relevance of the video to the text, where the representation itself is a function of the text it is evaluated for. As such, the input text may be used to weigh or attend to different frames of the video in determining the text-conditioned video embedding. The representation of the video may thus differ for different input texts for comparison. The text-conditioned video embedding may be determined in various ways, such as with a set of the most-similar frames to the input text (the top-k frames) or may be based on an attention function based on query, key, and value projections.