-
公开(公告)号:US11995899B2
公开(公告)日:2024-05-28
申请号:US17302291
申请日:2021-04-29
Applicant: Google LLC
Inventor: Qinge Wu , Grant Yoshida , Catherine Boulanger , Erik Hubert Dolly Goossens , Cem Keskin , Sofien Bouaziz , Jonathan James Taylor , Nidhi Rathi , Seth Raphael
CPC classification number: G06V20/63 , G02B27/0093 , G02B27/017 , G06V10/255 , G02B2027/0138 , G02B2027/014 , G06V30/10
Abstract: A head-mounted device (HMD) can be configured to determine a request for recognizing at least one content item included within content framed within a display of the HMD. The HMD can be configured to initiate a head-tracking process that maintains a coordinate system with respect to the content, and a pointer-tracking process that tracks a pointer that is visible together with the content within the display. The HMD can be configured to capture a first image of the content and a second image of the content, the second image including the pointer. The HMD can be configured to map a location of the pointer within the second image to a corresponding image location within the first image, using the coordinate system, and provide the at least one content item from the corresponding image location.
-
公开(公告)号:US20220350997A1
公开(公告)日:2022-11-03
申请号:US17302291
申请日:2021-04-29
Applicant: Google LLC
Inventor: Qinge Wu , Grant Yoshida , Catherine Boulanger , Erik Hubert Dolly Goossens , Cem Keskin , Sofien Bouaziz , Jonathan James Taylor , Nidhi Rathi , Seth Raphael
Abstract: A head-mounted device (HMD) can be configured to determine a request for recognizing at least one content item included within content framed within a display of the HMD. The HMD can be configured to initiate a head-tracking process that maintains a coordinate system with respect to the content, and a pointer-tracking process that tracks a pointer that is visible together with the content within the display. The HMD can be configured to capture a first image of the content and a second image of the content, the second image including the pointer. The HMD can be configured to map a location of the pointer within the second image to a corresponding image location within the first image, using the coordinate system, and provide the at least one content item from the corresponding image location.
-
公开(公告)号:US20180350087A1
公开(公告)日:2018-12-06
申请号:US15994509
申请日:2018-05-31
Applicant: Google LLC
Inventor: Adarsh Prakash Murthy KOWDLE , Vladimir TANKOVICH , Danhang TANG , Cem KESKIN , Jonathan James Taylor , Philip L. DAVIDSON , Shahram IZADI , Sean Ryan FANELLO , Julien Pascal Christophe VALENTIN , Christoph RHEMANN , Mingsong DOU , Sameh KHAMIS , David KIM
IPC: G06T7/593 , G06T7/32 , G06T7/33 , H04N13/271
CPC classification number: G06T7/521 , G06T7/593 , G06T2207/10021 , G06T2207/10024 , G06T2207/10152 , G06T2207/20072 , G06T2207/20076 , G06T2207/20081
Abstract: An electronic device estimates a depth map of an environment based on stereo depth images captured by depth cameras having exposure times that are offset from each other in conjunction with illuminators pulsing illumination patterns into the environment. A processor of the electronic device matches small sections of the depth images from the cameras to each other and to corresponding patches of immediately preceding depth images (e.g., a spatio-temporal image patch “cube”). The processor computes a matching cost for each spatio-temporal image patch cube by converting each spatio-temporal image patch into binary codes and defining a cost function between two stereo image patches as the difference between the binary codes. The processor minimizes the matching cost to generate a disparity map, and optimizes the disparity map by rejecting outliers using a decision tree with learned pixel offsets and refining subpixels to generate a depth map of the environment.
-
公开(公告)号:US12066282B2
公开(公告)日:2024-08-20
申请号:US17413847
申请日:2020-11-11
Applicant: GOOGLE LLC
Inventor: Sean Ryan Francesco Fanello , Kaiwen Guo , Peter Christopher Lincoln , Philip Lindsley Davidson , Jessica L. Busch , Xueming Yu , Geoffrey Harvey , Sergio Orts Escolano , Rohit Kumar Pandey , Jason Dourgarian , Danhang Tang , Adarsh Prakash Murthy Kowdle , Emily B. Cooper , Mingsong Dou , Graham Fyffe , Christoph Rhemann , Jonathan James Taylor , Shahram Izadi , Paul Ernest Debevec
IPC: G01B11/25 , G01B11/245 , G06T15/50 , G06T17/20
CPC classification number: G01B11/2513 , G01B11/245 , G06T15/506 , G06T17/205
Abstract: A lighting stage includes a plurality of lights that project alternating spherical color gradient illumination patterns onto an object or human performer at a predetermined frequency. The lighting stage also includes a plurality of cameras that capture images of an object or human performer corresponding to the alternating spherical color gradient illumination patterns. The lighting stage also includes a plurality of depth sensors that capture depth maps of the object or human performer at the predetermined frequency. The lighting stage also includes (or is associated with) one or more processors that implement a machine learning algorithm to produce a three-dimensional (3D) model of the object or human performer. The 3D model includes relighting parameters used to relight the 3D model under different lighting conditions.
-
公开(公告)号:US20240020915A1
公开(公告)日:2024-01-18
申请号:US18353213
申请日:2023-07-17
Applicant: Google LLC
Inventor: Yinda Zhang , Feitong Tan , Sean Ryan Francesco Fanello , Abhimitra Meka , Sergio Orts Escolano , Danhang Tang , Rohit Kumar Pandey , Jonathan James Taylor
Abstract: Techniques include introducing a neural generator configured to produce novel faces that can be rendered at free camera viewpoints (e.g., at any angle with respect to the camera) and relit under an arbitrary high dynamic range (HDR) light map. A neural implicit intrinsic field takes a randomly sampled latent vector as input and produces as output per-point albedo, volume density, and reflectance properties for any queried 3D location. These outputs are aggregated via a volumetric rendering to produce low resolution albedo, diffuse shading, specular shading, and neural feature maps. The low resolution maps are then upsampled to produce high resolution maps and input into a neural renderer to produce relit images.
-
公开(公告)号:US20220065620A1
公开(公告)日:2022-03-03
申请号:US17413847
申请日:2020-11-11
Applicant: GOOGLE LLC
Inventor: Sean Ryan Francesco Fanello , Kaiwen Guo , Peter Christopher Lincoln , Philip Lindsley Davidson , Jessica L. Busch , Xueming Yu , Geoffrey Harvey , Sergio Orts Escolano , Rohit Kumar Pandey , Jason Dourgarian , Danhang Tang , Adarsh Prakash Murthy Kowdle , Emily B. Cooper , Mingsong Dou , Graham Fyffe , Christoph Rhemann , Jonathan James Taylor , Shahram Izadi , Paul Ernest Debevec
IPC: G01B11/25 , G06T15/50 , G01B11/245 , G06T17/20
Abstract: A lighting stage includes a plurality of lights that project alternating spherical color gradient illumination patterns onto an object or human performer at a predetermined frequency. The lighting stage also includes a plurality of cameras that capture images of an object or human performer corresponding to the alternating spherical color gradient illumination patterns. The lighting stage also includes a plurality of depth sensors that capture depth maps of the object or human performer at the predetermined frequency. The lighting stage also includes (or is associated with) one or more processors that implement a machine learning algorithm to produce a three-dimensional (3D) model of the object or human performer. The 3D model includes relighting parameters used to relight the 3D model under different lighting conditions.
-
公开(公告)号:US11030773B2
公开(公告)日:2021-06-08
申请号:US16798881
申请日:2020-02-24
Applicant: Google LLC
Inventor: Jonathan James Taylor , Vladimir Tankovich , Danhang Tang , Cem Keskin , Adarsh Prakash Murthy Kowdle , Philip L. Davidson , Shahram Izadi , David Kim
Abstract: An electronic device estimates a pose of a hand by volumetrically deforming a signed distance field using a skinned tetrahedral mesh to locate a local minimum of an energy function, wherein the local minimum corresponds to the hand pose. The electronic device identifies a pose of the hand by fitting an implicit surface model of a hand to the pixels of a depth image that correspond to the hand. The electronic device uses a skinned tetrahedral mesh to warp space from a base pose to a deformed pose to define an articulated signed distance field from which the hand tracking module derives candidate poses of the hand. The electronic device then minimizes an energy function based on the distance of each corresponding pixel to identify the candidate pose that most closely approximates the pose of the hand.
-
公开(公告)号:US11164337B2
公开(公告)日:2021-11-02
申请号:US16956730
申请日:2019-10-04
Applicant: Google LLC
Inventor: Edward T. Chang , Mykhaylo Kurinnyy , Zhaowei Wang , Jonathan James Taylor
IPC: G06T7/80 , G06T7/73 , H04N13/239 , H04N13/254
Abstract: This document describes techniques and systems that enable autocalibration for multiple cameras using near-infrared (NIR) illuminators. The techniques and systems include a user device that uses NIR images, including dot images and flood images, captured by the multiple cameras. The user device implements an autocalibration module that normalizes the contrast of each image. Then, the autocalibration module detects dot features in the dot images and detects flood features in the flood images. The autocalibration module uses the flood features to disambiguate the dot features in the dot images. Then, the autocalibration module uses the disambiguated dot features to determine new calibration parameters for recalibration of the multiple cameras. In some aspects, the autocalibration module uses the dot images, rather than the flood images, to detect the flood features for disambiguating the dot features.
-
公开(公告)号:US20210248782A1
公开(公告)日:2021-08-12
申请号:US16956730
申请日:2019-10-04
Applicant: Google LLC
Inventor: Edward T. Chang , Mykhaylo Kurinnyy , Zhaowei Wang , Jonathan James Taylor
IPC: G06T7/80 , G06T7/73 , H04N13/239 , H04N13/254
Abstract: This document describes techniques and systems that enable autocalibration for multiple cameras using near-infrared (NIR) illuminators. The techniques and systems include a user device that uses NIR images, including dot images and flood images, captured by the multiple cameras. The user device implements an autocalibration module that normalizes the contrast of each image. Then, the autocalibration module detects dot features in the dot images and detects flood features in the flood images. The autocalibration module uses the flood features to disambiguate the dot features in the dot images. Then, the autocalibration module uses the disambiguated dot features to determine new calibration parameters for recalibration of the multiple cameras. In some aspects, the autocalibration module uses the dot images, rather than the flood images, to detect the flood features for disambiguating the dot features.
-
公开(公告)号:US10824226B2
公开(公告)日:2020-11-03
申请号:US16002595
申请日:2018-06-07
Applicant: Google LLC
Abstract: An electronic device estimates a pose of a face by fitting a generative face model mesh to a depth map based on vertices of the face model mesh that are estimated to be visible from the point of view of a depth camera. A face tracking module of the electronic device receives a depth image of a face from a depth camera and generates a depth map of the face based on the depth image. The face tracking module identifies a pose of the face by fitting a face model mesh to the pixels of a depth map that correspond to the vertices of the face model mesh that are estimated to be visible from the point of view of the depth camera.
-
-
-
-
-
-
-
-
-