-
公开(公告)号:US20240355107A1
公开(公告)日:2024-10-24
申请号:US18684883
申请日:2021-08-23
Applicant: Google LLC
Inventor: Orly Liba , Michael Garth Milne , Navin Padman Sarma , Doron Kukliansky , Huizhong Chen , Yael Pritch Knaan
CPC classification number: G06V10/82 , G06T5/60 , G06V10/462 , G06T2207/20084 , G06T2207/20132 , G06V2201/07 , G06V2201/10
Abstract: A method includes receiving training data comprising a plurality of images. one or more identified objects in each of the plurality of images. and a detection score associated with each of the one or more identified objects. wherein the detection score for an object is indicative of a degree to which a portion of an image corresponds to the object. The method also includes training a neural network based on the training data to predict a distractor score for at least one object of the one or more identified objects in an input image, wherein the at least one object is selected based on an associated detection score, and wherein the distractor score for the at least one object is indicative of a perceived visual distraction caused by a presence of the at least one object in the input image. The method additionally includes outputting the trained neural network.
-
公开(公告)号:US11599747B2
公开(公告)日:2023-03-07
申请号:US17090948
申请日:2020-11-06
Applicant: Google LLC
Inventor: Yael Pritch Knaan , Marc Levoy , Neal Wadhwa , Rahul Garg , Sameer Ansari , Jiawen Chen
IPC: G06K9/62
Abstract: Apparatus and methods related to using machine learning to determine depth maps for dual pixel images of objects are provided. A computing device can receive a dual pixel image of at least a foreground object. The dual pixel image can include a plurality of dual pixels. A dual pixel of the plurality of dual pixels can include a left-side pixel and a right-side pixel that both represent light incident on a single dual pixel element used to capture the dual pixel image. The computing device can be used to train a machine learning system to determine a depth map associated with the dual pixel image. The computing device can provide the trained machine learning system.
-
公开(公告)号:US20230015117A1
公开(公告)日:2023-01-19
申请号:US17856370
申请日:2022-07-01
Applicant: Google LLC
Inventor: Kfir Aberman , David Edward Jacobs , Kai Jochen Kohlhoff , Michael Rubinstein , Yossi Gandelsman , Junfeng He , Inbar Mosseri , Yael Pritch Knaan
Abstract: Techniques for tuning an image editing operator for reducing a distractor in raw image data are presented herein. The image editing operator can access the raw image data and a mask. The mask can indicate a region of interest associated with the raw image data. The image editing operator can process the raw image data and the mask to generate processed image data. Additionally, a trained saliency model can process at least the processed image data within the region of interest to generate a saliency map that provides saliency values. Moreover, a saliency loss function can compare the saliency values provided by the saliency map for the processed image data within the region of interest to one or more target saliency values. Subsequently, the one or more parameter values of the image editing operator can be modified based at least in part on the saliency loss function.
-
公开(公告)号:US20200242788A1
公开(公告)日:2020-07-30
申请号:US16652568
申请日:2017-12-05
Applicant: Google LLC
Inventor: David Jacobs , Rahul Garg , Yael Pritch Knaan , Neal Wadhwa , Marc Levoy
Abstract: A camera may capture an image of a scene and use the image to generate a first and a second subpixel image of the scene. The pair of subpixel images may be represented by a first set of subpixels and a second set of subpixels from the image respectively. Each pixel of the image may include two green subpixels that are respectively represented in the first and second subpixel images. The camera may determine a disparity between a portion of the scene as represented by the pair of subpixel images and may estimate a depth map of the scene that indicates a depth of the portion relative to other portions of the scene based on the disparity and a baseline distance between the two green subpixels. A new version of the image may be generated with a focus upon the portion and with the other portions of the scene blurred.
-
公开(公告)号:US12217472B2
公开(公告)日:2025-02-04
申请号:US17968634
申请日:2022-10-18
Applicant: Google LLC
Inventor: Orly Liba , Nikhil Karnad , Nori Kanazawa , Yael Pritch Knaan , Huizhong Chen , Longqi Cai
IPC: G06V10/26 , G06T5/20 , G06T5/77 , G06T5/94 , G06T11/00 , G06V10/764 , G06V10/774 , G06V20/20
Abstract: A media application generates training data that includes a first set of visual media items and a second set of visual media items, where the first set of visual media items correspond to the second set of visual items and include distracting objects that are manually segmented. The media application trains a segmentation machine-learning model based on the training data to receive a visual media item with one or more distracting objects and to output a segmentation mask for one or more segmented objects that correspond to the one or more distracting objects.
-
6.
公开(公告)号:US12175642B2
公开(公告)日:2024-12-24
申请号:US17726720
申请日:2022-04-22
Applicant: Google LLC
Inventor: Noritsugu Kanazawa , Neal Wadhwa , Yael Pritch Knaan
Abstract: Systems and methods for augmenting images can utilize one or more image augmentation models and one or more texture transfer blocks. The image augmentation model can process input images and one or more segmentation masks to generate first output data. The first output data and the one or more segmentation masks can be processed with the texture transfer block to generate an augmented image. The input image can depict a scene with one or more occlusions, and the augmented image can depict the scene with the one or more occlusions replaced with predicted pixel data.
-
公开(公告)号:US12169911B2
公开(公告)日:2024-12-17
申请号:US18334700
申请日:2023-06-14
Applicant: Google LLC
Inventor: Kfir Aberman , Yotam Nitzan , Orly Liba , Yael Pritch Knaan , Qiurui He , Inbar Mosseri , Yossi Gandelsman , Michal Yarom
Abstract: Systems and methods for identifying a personalized prior within a generative model's latent vector space based on a set of images of a given subject. In some examples, the present technology may further include using the personalized prior to confine the inputs of a generative model to a latent vector space associated with the given subject, such that when the model is tasked with editing an image of the subject (e.g., to perform inpainting to fill in masked areas, improve resolution, or deblur the image), the subject's identifying features will be reflected in the images the model produces.
-
公开(公告)号:US20240296596A1
公开(公告)日:2024-09-05
申请号:US18569844
申请日:2023-08-23
Applicant: Google LLC
Inventor: Kfir Aberman , Nataniel Ruiz Gutierrez , Michael Rubinstein , Yuanzhen Li , Yael Pritch Knaan , Varun Jampani
IPC: G06T11/00 , G06V10/764
CPC classification number: G06T11/00 , G06V10/764 , G06V2201/07
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a text-to-image model so that the text-to-image model generates images that each depict a variable instance of an object class when the object class without the unique identifier is provided as a text input, and that generates images that each depict a same subject instance of the object class when the unique identifier is provided as the text input.
-
公开(公告)号:US20230325998A1
公开(公告)日:2023-10-12
申请号:US18334700
申请日:2023-06-14
Applicant: Google LLC
Inventor: Kfir Aberman , Yotam Nitzan , Orly Liba , Yael Pritch Knaan , Qiurui He , Inbar Mosseri , Yossi Gandelsman , Michal Yarom
CPC classification number: G06T5/50 , G06T5/001 , G06T3/40 , G06T2207/20081 , G06T2207/20084
Abstract: Systems and methods for identifying a personalized prior within a generative model's latent vector space based on a set of images of a given subject. In some examples, the present technology may further include using the personalized prior to confine the inputs of a generative model to a latent vector space associated with the given subject, such that when the model is tasked with editing an image of the subject (e.g., to perform inpainting to fill in masked areas, improve resolution, or deblur the image), the subject's identifying features will be reflected in the images the model produces.
-
公开(公告)号:US20220230323A1
公开(公告)日:2022-07-21
申请号:US17617560
申请日:2019-07-15
Applicant: Google LLC
Inventor: Orly Liba , Florian Kainz , Longqi Cai , Yael Pritch Knaan
IPC: G06T7/11 , G06T5/00 , G06V10/764
Abstract: A device automatically segments an image into different regions and automatically adjusts perceived exposure-levels or other characteristics associated with each of the different regions, to produce pictures that exceed expectations for the type of optics and camera equipment being used and in some cases, the pictures even resemble other high-quality photography created using professional equipment and photo editing software. A machine-learned model is trained to automatically segment an image into distinct regions. The model outputs one or more masks that define the distinct regions. The mask(s) are refined using a guided filter or other technique to ensure that edges of the mask(s) conform to edges of objects depicted in the image. By applying the mask(s) to the image, the device can individually adjust respective characteristics of each of the different regions to produce a higher-quality picture of a scene.
-
-
-
-
-
-
-
-
-