-
公开(公告)号:US12211276B2
公开(公告)日:2025-01-28
申请号:US17526969
申请日:2021-11-15
Applicant: QUALCOMM Technologies, Inc.
Inventor: Christen Maximilian Filtenborg , Deepak Kumar Gupta
Abstract: A computer-implemented method for tracking with visual object constraints includes receiving a lingual constraint and a video. A word embedding is generated based on the lingual constraint. A set of features is extracted for one or more frames of the video. The word embedding is cross-correlated to the set of features for the one or more frames of the video. A prediction indicating whether the lingual constraint is in the one or more frames of the video is generated based on the cross-correlation.