PROMPT-TO-PROMPT IMAGE EDITING WITH CROSS-ATTENTION CONTROL

    公开(公告)号:US20240037822A1

    公开(公告)日:2024-02-01

    申请号:US18228614

    申请日:2023-07-31

    Applicant: GOOGLE LLC

    CPC classification number: G06T11/60 G06F3/04845 G06F40/40

    Abstract: Some implementations are directed to editing a source image, where the source image is one generated based on processing a source natural language (NL) prompt using a Large-scale language-image (LLI) model. Those implementations edit the source image based on user interface input that indicates an edit to the source NL prompt, and optionally independent of any user interface input that specifies a mask in the source image and/or independent of any other user interface input. Some implementations of the present disclosure are additionally or alternatively directed to applying prompt-to-prompt editing techniques to editing a source image that is one generated based on a real image, and that approximates the real image.

    Techniques For Reducing Distractions In An Image

    公开(公告)号:US20230094723A1

    公开(公告)日:2023-03-30

    申请号:US17487741

    申请日:2021-09-28

    Applicant: Google LLC

    Abstract: Techniques for reducing a distractor object in a first image are presented herein. A system can access a mask and the first image. A distractor object in the first image can be inside a region of interest and can have a pixel with an original attribute. Additionally, the system can process, using a machine-learned inpainting model, the first image and the mask to generate an inpainted image. The pixel of the distractor object in the inpainted image can have an inpainted attribute in chromaticity channels. Moreover, the system can determine a palette transform based on a comparison of the first image and the inpainted image. The transform attribute can be different from the inpainted attribute. Furthermore, the system can process the first image to generate a recolorized image. The pixel in the recolorized image can have a recolorized attribute based on the transform attribute of the palette transform.

    Deep Saliency Prior
    4.
    发明申请

    公开(公告)号:US20230015117A1

    公开(公告)日:2023-01-19

    申请号:US17856370

    申请日:2022-07-01

    Applicant: Google LLC

    Abstract: Techniques for tuning an image editing operator for reducing a distractor in raw image data are presented herein. The image editing operator can access the raw image data and a mask. The mask can indicate a region of interest associated with the raw image data. The image editing operator can process the raw image data and the mask to generate processed image data. Additionally, a trained saliency model can process at least the processed image data within the region of interest to generate a saliency map that provides saliency values. Moreover, a saliency loss function can compare the saliency values provided by the saliency map for the processed image data within the region of interest to one or more target saliency values. Subsequently, the one or more parameter values of the image editing operator can be modified based at least in part on the saliency loss function.

    Machine Learning Models for Example-Guided Image Inpainting

    公开(公告)号:US20250037251A1

    公开(公告)日:2025-01-30

    申请号:US18717098

    申请日:2022-01-13

    Applicant: Google LLC

    Abstract: A method includes obtaining an input image having a region to be inpainted, an indication of the region to be inpainted, and a guide image. The method also includes determining, by an encoder model, a first latent representation of the input image and a second latent representation of the guide image, and generating a combined latent representation based on the first latent representation and the second latent representation. The method additionally includes generating, by a style generative adversarial network model and based on the combined latent representation, an intermediate output image that includes inpainted image content for the region to be inpainted in the input image. The method further includes generating, based on the input image, the indication of the region, and the intermediate output image, an output image representing the input image with the region to be inpainted including the inpainted image content from the intermediate output image.

    Techniques for Reducing Distractions in an Image

    公开(公告)号:US20240046532A1

    公开(公告)日:2024-02-08

    申请号:US18489539

    申请日:2023-10-18

    Applicant: Google LLC

    CPC classification number: G06T11/001 G06N20/00 G06T5/005 G06T11/60

    Abstract: Techniques for reducing a distractor object in a first image are presented herein. A system can access a mask and the first image. A distractor object in the first image can be inside a region of interest and can have a pixel with an original attribute. Additionally, the system can process, using a machine-learned inpainting model, the first image and the mask to generate an inpainted image. The pixel of the distractor object in the inpainted image can have an inpainted attribute in chromaticity channels. Moreover, the system can determine a palette transform based on a comparison of the first image and the inpainted image. The transform attribute can be different from the inpainted attribute. Furthermore, the system can process the first image to generate a recolorized image. The pixel in the recolorized image can have a recolorized attribute based on the transform attribute of the palette transform.

Patent Agency Ranking