-
公开(公告)号:US20230281979A1
公开(公告)日:2023-09-07
申请号:US18006078
申请日:2020-08-03
Applicant: Xuhui JIA , Raviteja VEMULAPALLI , Yukun ZHU , Bradley Ray GREEN , Bardia DOOSTI , Ching-Hui CHEN , Google LLC
Inventor: Xuhui Jia , Raviteja Vemulapalli , Bradley Ray Green , Bardia Doosti , Ching-Hui Chen
IPC: G06V10/82 , G06V10/776
CPC classification number: G06V10/82 , G06V10/776
Abstract: Systems and methods of the present disclosure are directed to a method for training a machine-learned visual attention model. The method can include obtaining image data that depicts a head of a person and an additional entity. The method can include processing the image data with an encoder portion of the visual attention model to obtain latent head and entity encodings. The method can include processing the latent encodings with the visual attention model to obtain a visual attention value and processing the latent encodings with a machine-learned visual location model to obtain a visual location estimation. The method can include training the models by evaluating a loss function that evaluates differences between the visual location estimation and a pseudo visual location label derived from the image data and between the visual attention value and a ground truth visual attention label.
-
2.
公开(公告)号:US20230113131A1
公开(公告)日:2023-04-13
申请号:US17909579
申请日:2020-03-05
Applicant: Shawn O'Banion , Wenhuan WEI , Yukun ZHU , Google LLC
Inventor: Shawn Ryan O'Banion , Wenhuan Wei , Yukun Zhu
Abstract: The present disclosure is directed to systems and methods for performing automated labeling of images. Labeled images can be used to train machine-learned models to infer image attributes such as quality for suggesting user actions.
-