-
公开(公告)号:US12053301B2
公开(公告)日:2024-08-06
申请号:US17339128
申请日:2021-06-04
Applicant: Google LLC
Inventor: Avneesh Sud , Steven Hickson , Vivek Kwatra , Nicholas Dufour
IPC: A61B5/00 , A61B5/16 , G01S3/00 , G06F3/01 , G06F18/214 , G06F18/2413 , G06N3/04 , G06N3/045 , G06N3/08 , G06V10/44 , G06V10/764 , G06V10/82 , G06V20/20 , G06V40/16 , H04N23/60 , H04N23/611 , G06V40/19 , H04N13/344
CPC classification number: A61B5/6803 , A61B5/165 , G01S3/00 , G06F3/013 , G06F18/214 , G06F18/2413 , G06N3/04 , G06N3/045 , G06N3/08 , G06V10/454 , G06V10/764 , G06V10/82 , G06V20/20 , G06V40/174 , H04N23/60 , H04N23/611 , A61B5/163 , A61B5/7267 , A61B5/7275 , G06V40/19 , H04N13/344
Abstract: Images of a plurality of users are captured concurrently with the plurality of users evincing a plurality of expressions. The images are captured using one or more eye tracking sensors implemented in one or more head mounted devices (HMDs) worn by the plurality of first users. A machine learnt algorithm is trained to infer labels indicative of expressions of the users in the images. A live image of a user is captured using an eye tracking sensor implemented in an HMD worn by the user. A label of an expression evinced by the user in the live image is inferred using the machine learnt algorithm that has been trained to predict labels indicative of expressions. The images of the users and the live image can be personalized by combining the images with personalization images of the users evincing a subset of the expressions.
-
公开(公告)号:US20200027002A1
公开(公告)日:2020-01-23
申请号:US16511637
申请日:2019-07-15
Applicant: Google LLC
Inventor: Steven Hickson , Anelia Angelova , Irfan Aziz Essa , Rahul Sukthankar
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for determining a clustering of images into a plurality of semantic categories. In one aspect, a method comprises: training a categorization neural network, comprising, at each of a plurality of iterations: processing an image depicting an object using the categorization neural network to generate (i) a current prediction for whether the image depicts an object or a background region, and (ii) a current embedding of the image; determining a plurality of current cluster centers based on the current values of the categorization neural network parameters, wherein each cluster center represents a respective semantic category; and determining a gradient of an objective function that includes a classification loss and a clustering loss, wherein the clustering loss depends on a similarity between the current embedding of the image and the current cluster centers.
-
公开(公告)号:US11042729B2
公开(公告)日:2021-06-22
申请号:US15831823
申请日:2017-12-05
Applicant: Google LLC
Inventor: Avneesh Sud , Steven Hickson , Vivek Kwatra , Nicholas Dufour
IPC: G06K9/00 , A61B5/16 , G06N3/08 , G06N3/04 , H04N5/232 , G06K9/62 , G06F3/01 , A61B5/00 , G01S3/00 , G06K9/46 , H04N13/344
Abstract: Images of a plurality of users are captured concurrently with the plurality of users evincing a plurality of expressions. The images are captured using one or more eye tracking sensors implemented in one or more head mounted devices (HMDs) worn by the plurality of first users. A machine learnt algorithm is trained to infer labels indicative of expressions of the users in the images. A live image of a user is captured using an eye tracking sensor implemented in an HMD worn by the user. A label of an expression evinced by the user in the live image is inferred using the machine learnt algorithm that has been trained to predict labels indicative of expressions. The images of the users and the live image can be personalized by combining the images with personalization images of the users evincing a subset of the expressions.
-
公开(公告)号:US20180314881A1
公开(公告)日:2018-11-01
申请号:US15831823
申请日:2017-12-05
Applicant: Google LLC
Inventor: Avneesh Sud , Steven Hickson , Vivek Kwatra , Nicholas Dufour
Abstract: Images of a plurality of users are captured concurrently with the plurality of users evincing a plurality of expressions. The images are captured using one or more eye tracking sensors implemented in one or more head mounted devices (HMDs) worn by the plurality of first users. A machine learnt algorithm is trained to infer labels indicative of expressions of the users in the images. A live image of a user is captured using an eye tracking sensor implemented in an HMD worn by the user. A label of an expression evinced by the user in the live image is inferred using the machine learnt algorithm that has been trained to predict labels indicative of expressions. The images of the users and the live image can be personalized by combining the images with personalization images of the users evincing a subset of the expressions.
-
公开(公告)号:US20250032045A1
公开(公告)日:2025-01-30
申请号:US18792056
申请日:2024-08-01
Applicant: Google LLC
Inventor: Avneesh Sud , Steven Hickson , Vivek Kwatra , Nicholas Dufour
IPC: A61B5/00 , A61B5/16 , G01S3/00 , G06F3/01 , G06F18/214 , G06F18/2413 , G06N3/04 , G06N3/045 , G06N3/08 , G06V10/44 , G06V10/764 , G06V10/82 , G06V20/20 , G06V40/16 , G06V40/19 , H04N13/344 , H04N23/60 , H04N23/611
Abstract: Images of a plurality of users are captured concurrently with the plurality of users evincing a plurality of expressions. The images are captured using one or more eye tracking sensors implemented in one or more head mounted devices (HMDs) worn by the plurality of first users. A machine learnt algorithm is trained to infer labels indicative of expressions of the users in the images. A live image of a user is captured using an eye tracking sensor implemented in an HMD worn by the user. A label of an expression evinced by the user in the live image is inferred using the machine learnt algorithm that has been trained to predict labels indicative of expressions. The images of the users and the live image can be personalized by combining the images with personalization images of the users evincing a subset of the expressions.
-
公开(公告)号:US20210295025A1
公开(公告)日:2021-09-23
申请号:US17339128
申请日:2021-06-04
Applicant: Google LLC
Inventor: Avneesh Sud , Steven Hickson , Vivek Kwatra , Nicholas Dufour
IPC: G06K9/00 , A61B5/16 , G06N3/08 , G06N3/04 , H04N5/232 , G06K9/62 , G06F3/01 , A61B5/00 , G01S3/00 , G06K9/46
Abstract: Images of a plurality of users are captured concurrently with the plurality of users evincing a plurality of expressions. The images are captured using one or more eye tracking sensors implemented in one or more head mounted devices (HMDs) worn by the plurality of first users. A machine learnt algorithm is trained to infer labels indicative of expressions of the users in the images. A live image of a user is captured using an eye tracking sensor implemented in an HMD worn by the user. A label of an expression evinced by the user in the live image is inferred using the machine learnt algorithm that has been trained to predict labels indicative of expressions. The images of the users and the live image can be personalized by combining the images with personalization images of the users evincing a subset of the expressions.
-
-
-
-
-