-
公开(公告)号:US11335093B2
公开(公告)日:2022-05-17
申请号:US16966102
申请日:2019-06-12
Applicant: Google LLC
Inventor: Abhinav Shrivastava , Alireza Fathi , Sergio Guadarrama Cotado , Kevin Patrick Murphy , Carl Martin Vondrick
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for performing visual tracking. In one aspect, a method comprises receiving: (i) one or more reference video frames, (ii) respective reference labels for each of a plurality of reference pixels in the reference video frames, and (iii) a target video frame. The reference video frames and the target video frame are processed using a colorization machine learning model to generate respective pixel similarity measures between each of (i) a plurality of target pixels in the target video frame, and (ii) the reference pixels in the reference video frames. A respective target label is determined for each target pixel in the target video frame, comprising: combining (i) the reference labels for the reference pixels in the reference video frames, and (ii) the pixel similarity measures.
-
公开(公告)号:US20210089777A1
公开(公告)日:2021-03-25
申请号:US16966102
申请日:2019-06-12
Applicant: Google LLC
Inventor: Abhinav Shrivastava , Alireza Fathi , Sergio Guadarrama Cotado , Kevin Patrick Murphy , Carl Martin Vondrick
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for performing visual tracking. In one aspect, a method comprises receiving: (i) one or more reference video frames, (ii) respective reference labels for each of a plurality of reference pixels in the reference video frames, and (iii) a target video frame. The reference video frames and the target video frame are processed using a colorization machine learning model to generate respective pixel similarity measures between each of (i) a plurality of target pixels in the target video frame, and (ii) the reference pixels in the reference video frames. A respective target label is determined for each target pixel in the target video frame, comprising: combining (i) the reference labels for the reference pixels in the reference video frames, and (ii) the pixel similarity measures.
-
公开(公告)号:US20240338387A1
公开(公告)日:2024-10-10
申请号:US18626268
申请日:2024-04-03
Applicant: Google LLC
Inventor: Ahmet Iscen , Alireza Fathi , Cordelia Luise Schmid
IPC: G06F16/28 , G06F16/242
CPC classification number: G06F16/285 , G06F16/2438
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for performing a classification task on a data item. In particular, a system classifies an input data item using key and value embeddings of memory data items.
-
-