-
公开(公告)号:US11868523B2
公开(公告)日:2024-01-09
申请号:US17305219
申请日:2021-07-01
Applicant: GOOGLE LLC
Inventor: Ivana Tosic Rodgers , Sean Ryan Francesco Fanello , Sofien Bouaziz , Rohit Kumar Pandey , Eric Aboussouan , Adarsh Prakash Murthy Kowdle
CPC classification number: G06F3/013 , G02B27/0093 , G02B27/0101 , G02B27/0172 , G06F18/23 , G06N3/08 , G02B2027/0138 , G02B2027/0178
Abstract: Techniques of tracking a user's gaze includes identifying a region of a display at which a gaze of a user is directed, the region including a plurality of pixels. By determining a region rather than a point, when the regions correspond to elements of a user interface, the improved technique enables a system to activate the element to which a determined region is selected. In some implementations, the system makes the determination using a classification engine including a convolutional neural network; such an engine takes as input images of the user's eye and outputs a list of probabilities that the gaze is directed to each of the regions.
-
公开(公告)号:US20230004216A1
公开(公告)日:2023-01-05
申请号:US17305219
申请日:2021-07-01
Applicant: GOOGLE LLC
Inventor: Ivana Tosic Rodgers , Sean Ryan Francesco Fanello , Sofien Bouaziz , Rohit Kumar Pandey , Eric Aboussouan , Adarsh Prakash Murthy Kowdle
Abstract: Techniques of tracking a user's gaze includes identifying a region of a display at which a gaze of a user is directed, the region including a plurality of pixels. By determining a region rather than a point, when the regions correspond to elements of a user interface, the improved technique enables a system to activate the element to which a determined region is selected. In some implementations, the system makes the determination using a classification engine including a convolutional neural network; such an engine takes as input images of the user's eye and outputs a list of probabilities that the gaze is directed to each of the regions.
-
公开(公告)号:US20220065620A1
公开(公告)日:2022-03-03
申请号:US17413847
申请日:2020-11-11
Applicant: GOOGLE LLC
Inventor: Sean Ryan Francesco Fanello , Kaiwen Guo , Peter Christopher Lincoln , Philip Lindsley Davidson , Jessica L. Busch , Xueming Yu , Geoffrey Harvey , Sergio Orts Escolano , Rohit Kumar Pandey , Jason Dourgarian , Danhang Tang , Adarsh Prakash Murthy Kowdle , Emily B. Cooper , Mingsong Dou , Graham Fyffe , Christoph Rhemann , Jonathan James Taylor , Shahram Izadi , Paul Ernest Debevec
IPC: G01B11/25 , G06T15/50 , G01B11/245 , G06T17/20
Abstract: A lighting stage includes a plurality of lights that project alternating spherical color gradient illumination patterns onto an object or human performer at a predetermined frequency. The lighting stage also includes a plurality of cameras that capture images of an object or human performer corresponding to the alternating spherical color gradient illumination patterns. The lighting stage also includes a plurality of depth sensors that capture depth maps of the object or human performer at the predetermined frequency. The lighting stage also includes (or is associated with) one or more processors that implement a machine learning algorithm to produce a three-dimensional (3D) model of the object or human performer. The 3D model includes relighting parameters used to relight the 3D model under different lighting conditions.
-
公开(公告)号:US11030773B2
公开(公告)日:2021-06-08
申请号:US16798881
申请日:2020-02-24
Applicant: Google LLC
Inventor: Jonathan James Taylor , Vladimir Tankovich , Danhang Tang , Cem Keskin , Adarsh Prakash Murthy Kowdle , Philip L. Davidson , Shahram Izadi , David Kim
Abstract: An electronic device estimates a pose of a hand by volumetrically deforming a signed distance field using a skinned tetrahedral mesh to locate a local minimum of an energy function, wherein the local minimum corresponds to the hand pose. The electronic device identifies a pose of the hand by fitting an implicit surface model of a hand to the pixels of a depth image that correspond to the hand. The electronic device uses a skinned tetrahedral mesh to warp space from a base pose to a deformed pose to define an articulated signed distance field from which the hand tracking module derives candidate poses of the hand. The electronic device then minimizes an energy function based on the distance of each corresponding pixel to identify the candidate pose that most closely approximates the pose of the hand.
-
公开(公告)号:US10937182B2
公开(公告)日:2021-03-02
申请号:US15994471
申请日:2018-05-31
Applicant: Google LLC
Inventor: Mingsong Dou , Sean Ryan Fanello , Adarsh Prakash Murthy Kowdle , Christoph Rhemann , Sameh Khamis , Philip L. Davidson , Shahram Izadi , Vladimir Tankovich
Abstract: An electronic device estimates a pose of one or more subjects in an environment based on estimating a correspondence between a data volume containing a data mesh based on a current frame captured by a depth camera and a reference volume containing a plurality of fused prior data frames based on spectral embedding and performing bidirectional non-rigid matching between the reference volume and the current data frame to refine the correspondence so as to support location-based functionality. The electronic device predicts correspondences between the data volume and the reference volume based on spectral embedding. The correspondences provide constraints that accelerate the convergence between the data volume and the reference volume. By tracking changes between the current data mesh frame and the reference volume, the electronic device avoids tracking failures that can occur when relying solely on a previous data mesh frame.
-
公开(公告)号:US11810313B2
公开(公告)日:2023-11-07
申请号:US17249095
申请日:2021-02-19
Applicant: GOOGLE LLC
Inventor: Vladimir Tankovich , Christian Haene , Sean Ryan Francesco Fanello , Yinda Zhang , Shahram Izadi , Sofien Bouaziz , Adarsh Prakash Murthy Kowdle , Sameh Khamis
CPC classification number: G06T7/593 , G06T3/0093 , G06T3/40 , G06T5/30 , H04N13/20 , G06T2207/20016 , G06T2207/20084 , H04N2013/0081
Abstract: According to an aspect, a real-time active stereo system includes a capture system configured to capture stereo data, where the stereo data includes a first input image and a second input image, and a depth sensing computing system configured to predict a depth map. The depth sensing computing system includes a feature extractor configured to extract features from the first and second images at a plurality of resolutions, an initialization engine configured to generate a plurality of depth estimations, where each of the plurality of depth estimations corresponds to a different resolution, and a propagation engine configured to iteratively refine the plurality of depth estimations based on image warping and spatial propagation.
-
公开(公告)号:US11756223B2
公开(公告)日:2023-09-12
申请号:US17344256
申请日:2021-06-10
Applicant: Google LLC
Inventor: Tim Phillip Wantland , Brandon Charles Barbello , Christopher Max Breithaupt , Michael John Schoenberg , Adarsh Prakash Murthy Kowdle , Bryan Woods , Anshuman Kumar
IPC: G06T7/593 , G06T7/174 , G06T19/20 , H04N13/128 , H04N13/00
CPC classification number: G06T7/593 , G06T7/174 , G06T19/20 , H04N13/128 , H04N2013/0081
Abstract: The methods and systems described herein provide for depth-aware image editing and interactive features. In particular, a computer application may provide image-related features that utilize a combination of a (a) the depth map, and (b) segmentation data to process one or more images, and generate an edited version of the one or more images.
-
公开(公告)号:US20230186575A1
公开(公告)日:2023-06-15
申请号:US17998903
申请日:2020-05-22
Applicant: GOOGLE LLC
Inventor: Eric Turner , Keisuke Tateno , Konstantine Nicholas John Tsotsos , Adarsh Prakash Murthy Kowdle , Vaibhav Gupta , Ambrus Csaszar
CPC classification number: G06T19/006 , G06T1/60
Abstract: A method including receiving a first depth image associated with a first frame at a first time of an augmented reality (AR) application, the first depth image representing at least a first portion of a real-world space storing the first depth image receiving a second depth image associated with a second frame at a second time, after the first time, of the AR application, the second depth image representing at least a second portion of the real-world space generating a real-world image by blending, at least, the stored first depth image with the second depth image receiving a rendered AR object combining the AR object in the real-world image and displaying the real-world image combined with the AR object.
-
公开(公告)号:US11100664B2
公开(公告)日:2021-08-24
申请号:US16720743
申请日:2019-12-19
Applicant: Google LLC
Inventor: Tim Phillip Wantland , Brandon Charles Barbello , Christopher Max Breithaupt , Michael John Schoenberg , Adarsh Prakash Murthy Kowdle , Bryan Woods , Anshuman Kumar
Abstract: The methods and systems described herein provide for depth-aware image editing and interactive features. In particular, a computer application may provide image-related features that utilize a combination of a (a) the depth map, and (b) segmentation data to process one or more images, and generate an edited version of the one or more images.
-
公开(公告)号:US11037026B2
公开(公告)日:2021-06-15
申请号:US16749626
申请日:2020-01-22
Applicant: Google LLC
Inventor: Sean Ryan Fanello , Julien Pascal Christophe Valentin , Adarsh Prakash Murthy Kowdle , Christoph Rhemann , Vladimir Tankovich , Philip L. Davidson , Shahram Izadi
IPC: G06K9/62
Abstract: Values of pixels in an image are mapped to a binary space using a first function that preserves characteristics of values of the pixels. Labels are iteratively assigned to the pixels in the image in parallel based on a second function. The label assigned to each pixel is determined based on values of a set of nearest-neighbor pixels. The first function is trained to map values of pixels in a set of training images to the binary space and the second function is trained to assign labels to the pixels in the set of training images. Considering only the nearest neighbors in the inference scheme results in a computational complexity that is independent of the size of the solution space and produces sufficient approximations of the true distribution when the solution for each pixel is most likely found in a small subset of the set of potential solutions.
-
-
-
-
-
-
-
-
-