Patent search ap:("Google LLC") AND inv:"Tianhao Zhang" Page 1

1.

发明公开
IMAGE MANIPULATION BY TEXT INSTRUCTION 审中-公开

公开(公告)号：US20240212246A1

公开(公告)日：2024-06-27

申请号：US18400629

申请日：2023-12-29

Applicant: Google LLC

Inventor： Tianhao Zhang , Weilong Yang , Honglak Lee , Hung-Yu Tseng , Irfan Aziz Essa , Lu Jiang

IPC: G06T11/60 , G06N3/045 , G06N3/088 , G06T3/02 , G06T3/40 , G06T9/00

CPC classification number: G06T11/60 , G06N3/045 , G06N3/088 , G06T3/02 , G06T3/40 , G06T9/002

Abstract: A method for generating an output image from an input image and an input text instruction that specifies a location and a modification of an edit applied to the input image using a neural network is described. The neural network includes an image encoder, an image decoder, and an instruction attention network. The method includes receiving the input image and the input text instruction; extracting, from the input image, an input image feature that represents features of the input image using the image encoder; generating a spatial feature and a modification feature from the input text instruction using the instruction attention network; generating an edited image feature from the input image feature, the spatial feature and the modification feature; and generating the output image from the edited image feature using the image decoder.

2.

发明公开
ZOOM AGNOSTIC WATERMARK EXTRACTION 审中-公开

公开(公告)号：US20230325961A1

公开(公告)日：2023-10-12

申请号：US18008544

申请日：2021-06-21

Applicant: Google LLC

Inventor： Dake He , Tianhao Zhang , Elnaz Barshan Tashnizi , Xiyang Luo , Huiwen Chang , Feng Yang , Ryan Matthew Haggarty

IPC: G06T1/00 , G06T7/11 , G06T3/40

CPC classification number: G06T1/005 , G06T7/11 , G06T3/40 , G06T2201/0083 , G06T2207/20081 , G06T2201/0065

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for determining a visually imperceptible or a visually perceptible watermark and outputting a result based on the determination. A watermark decoder receives an input image. The watermark decoder applies a decoder machine learning model to decode a watermarks at different levels of zoom. The water mark decoder determines whether a watermark was decoded to obtain a decoded watermark. The watermark decoder outputs a result based on the determination whether the watermark was decoded through application of the decoder machine learning model to the input image that includes outputting a zoomed output decoded through application of the decoder machine learning model to the input image.

3.

发明授权
Image manipulation by text instruction 有权

公开(公告)号：US11562518B2

公开(公告)日：2023-01-24

申请号：US17340671

申请日：2021-06-07

Applicant: Google LLC

Inventor： Tianhao Zhang , Weilong Yang , Honglak Lee , Hung-Yu Tseng , Irfan Aziz Essa , Lu Jiang

IPC: G06T11/60 , G06T3/00 , G06N3/04 , G06N3/08 , G06T3/40 , G06T9/00

Abstract: A method for generating an output image from an input image and an input text instruction that specifies a location and a modification of an edit applied to the input image using a neural network is described. The neural network includes an image encoder, an image decoder, and an instruction attention network. The method includes receiving the input image and the input text instruction; extracting, from the input image, an input image feature that represents features of the input image using the image encoder; generating a spatial feature and a modification feature from the input text instruction using the instruction attention network; generating an edited image feature from the input image feature, the spatial feature and the modification feature; and generating the output image from the edited image feature using the image decoder.

4.

发明申请
IMAGE MANIPULATION BY TEXT INSTRUCTION 有权

公开(公告)号：US20210383584A1

公开(公告)日：2021-12-09

申请号：US17340671

申请日：2021-06-07

Applicant: Google LLC

Inventor： Tianhao Zhang , Weilong Yang , Honglak Lee , Hung-Yu Tseng , Irfan Aziz Essa , Lu Jiang

IPC: G06T11/60 , G06T3/00 , G06T3/40 , G06N3/08 , G06N3/04

Abstract: A method for generating an output image from an input image and an input text instruction that specifies a location and a modification of an edit applied to the input image using a neural network is described. The neural network includes an image encoder, an image decoder, and an instruction attention network. The method includes receiving the input image and the input text instruction; extracting, from the input image, an input image feature that represents features of the input image using the image encoder; generating a spatial feature and a modification feature from the input text instruction using the instruction attention network; generating an edited image feature from the input image feature, the spatial feature and the modification feature; and generating the output image from the edited image feature using the image decoder.

5.

发明授权
Image manipulation by text instruction 有权

公开(公告)号：US11900517B2

公开(公告)日：2024-02-13

申请号：US18085487

申请日：2022-12-20

Applicant: Google LLC

Inventor： Tianhao Zhang , Weilong Yang , Honglak Lee , Hung-Yu Tseng , Irfan Aziz Essa , Lu Jiang

IPC: G06T11/60 , G06T9/00 , G06T3/00 , G06N3/088 , G06T3/40 , G06N3/045

CPC classification number: G06T11/60 , G06N3/045 , G06N3/088 , G06T3/0006 , G06T3/40 , G06T9/002

Abstract: A method for generating an output image from an input image and an input text instruction that specifies a location and a modification of an edit applied to the input image using a neural network is described. The neural network includes an image encoder, an image decoder, and an instruction attention network. The method includes receiving the input image and the input text instruction; extracting, from the input image, an input image feature that represents features of the input image using the image encoder; generating a spatial feature and a modification feature from the input text instruction using the instruction attention network; generating an edited image feature from the input image feature, the spatial feature and the modification feature; and generating the output image from the edited image feature using the image decoder.

6.

发明公开
IMAGE MANIPULATION BY TEXT INSTRUCTION 审中-公开

公开(公告)号：US20230177754A1

公开(公告)日：2023-06-08

申请号：US18085487

申请日：2022-12-20

Applicant: Google LLC

Inventor： Tianhao Zhang , Weilong Yang , Honglak Lee , Hung-Yu Tseng , Irfan Aziz Essa , Lu Jiang

IPC: G06T11/60 , G06T3/00 , G06N3/088 , G06T3/40 , G06T9/00 , G06N3/045

CPC classification number: G06T11/60 , G06T3/0006 , G06N3/088 , G06T3/40 , G06T9/002 , G06N3/045

Abstract: A method for generating an output image from an input image and an input text instruction that specifies a location and a modification of an edit applied to the input image using a neural network is described. The neural network includes an image encoder, an image decoder, and an instruction attention network. The method includes receiving the input image and the input text instruction; extracting, from the input image, an input image feature that represents features of the input image using the image encoder; generating a spatial feature and a modification feature from the input text instruction using the instruction attention network; generating an edited image feature from the input image feature, the spatial feature and the modification feature; and generating the output image from the edited image feature using the image decoder.

7.

发明公开
ZOOM AGNOSTIC WATERMARK EXTRACTION 审中-公开

公开(公告)号：US20230325959A1

公开(公告)日：2023-10-12

申请号：US17926213

申请日：2021-06-21

Applicant: Google LLC

Inventor： Dake He , Tianhao Zhang , Elnaz Barshan Tashnizi , Xiyang Luo , Huiwen Chang , Feng Yang , Ryan Matthew Haggarty

IPC: G06T1/00 , G06T3/40 , G06T5/20 , G06V10/764

CPC classification number: G06T1/0021 , G06T3/40 , G06T5/20 , G06V10/764 , G06T2201/0065 , G06T2207/20081

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for detecting and decoding a visually imperceptible or perceptible watermark. A watermark detection apparatus determines whether the particular image includes a visually imperceptible or perceptible watermark using detector a machine learning model. If the watermark detection apparatus detects a watermark, the particular image is routed to a watermark decoder. If the watermark detection apparatus cannot detect a watermark in the particular image, the particular image is filtered from further processing. The watermark decoder decodes the visually imperceptible or perceptible watermark detected in the particular image. After decoding, an item depicted in the particular image is validated based data extracted from the decoded visually imperceptible or perceptible watermark.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification