-
公开(公告)号:US20240282131A1
公开(公告)日:2024-08-22
申请号:US18421672
申请日:2024-01-24
Applicant: Google LLC
Inventor: Jie Ren , Zhe Liu , James Urquhart Allingham , Michael Ward Dusenberry , Dustin Tran , Yin Cui , Balaji Lakshminarayanan , Xiuye Gu
IPC: G06V20/70 , G06F40/40 , G06V10/74 , G06V10/764 , G06V10/776
CPC classification number: G06V20/70 , G06F40/40 , G06V10/761 , G06V10/764 , G06V10/776
Abstract: Systems and methods for zero-shot prompt ensembling for zero-shot classification with text-image models can include utilizing a pre-trained text-image model to perform downstream tasks based on prompt-based weighting. The systems and methods may adjust for frequency-based bias and may automatically determine different prompt associations with a given downstream task. The systems and methods can aggregate weighted text embeddings and then determine a classification output based on similarity measures between an image embedding and the aggregated weighted text embeddings.